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Preface 


The material presented here can be divided into two parts. The first, some- 
times referred to as abstract algebra, is concerned with the general theory of 
algebraic objects such as groups, rings, and fields, hence, with topics that are 
also basic for a number of other domains in mathematics. The second centers 
around Galois theory and its applications. Historically, this theory originated 
from the problem of studying algebraic equations, a problem that, after various 
unsuccessful attempts to determine solution formulas in higher degrees, found 
its complete clarification through the brilliant ideas of E. Galois. To convert Ga- 
lois’s approach into a comprehensible theory, in other words, to set up Galois 
theory, has taken quite a period of time. The reason is that simultaneously sev- 
eral new concepts of algebra were emerging and had to be developed as natural 
prerequisites. In fact, the study of algebraic equations has served as a motivat- 
ing terrain for a large part of abstract algebra, and according to this, algebraic 
equations will be visible as a guiding thread throughout the book. 

To underline this point, I have included at the beginning a historical in- 
troduction to the problem of solving algebraic equations. Later, every chapter 
begins with some introductory remarks on “Background and Overview,” where 
I give motivation for the material that follows and where I discuss some of its 
highlights on an informal level. In contrast to this, the remaining “regular” sec- 
tions (some of them optional, indicated by a star) go step by step, elaborating 
the corresponding subject in full mathematical strength. I have tried to proceed 
in a way as simple and as clear as possible, basing arguments always on “true 
reasons,” in other words, without resorting to simplifying ad hoc solutions. The 
text should therefore be useful for “any” course on the subject and even for self- 
study, certainly since it is essentially self-contained, up to a few prerequisites 
from linear algebra. Each section ends with a list of specially adapted exercises, 
some of them printed in italics to signify that there are solution proposals in 
the appendix. 

On many occasions, I have given courses on the subject of this book, usually 
in units of two for consecutive semesters. In such courses I have addressed the 
“standard program” consisting of the unstarred sections. The latter yield a well- 
founded and direct access to the world of algebraic field extensions, with the 
fundamental theorem of Galois theory as a first milestone. Also let me point 
out that group theory has been split up into an elementary part in Chapter 1 
and a more advanced part later in Chapter 5 that is needed for the applications 
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of Galois theory. Of course, if preferred, Chapter 5 can be covered immediately 
after Chapter 1. Finally, the optional starred sections complement the standard 
program or, in some cases, provide a first view on nearby areas that are more 
advanced. Such sections are particularly well suited for seminars. 

The first versions of this book appeared in German as handouts for my stu- 
dents. They were later compiled into a book on algebra that appeared in 1993. 
I’m deeply indebted to my students and colleagues for their valuable comments 
and suggestions. All this found its way into later editions. The present English 
edition is a translation and critical revision of the eighth German edition of 2013. 
Here my thanks go to my colleague and friend Alan Huckleberry, with whom I 
discussed several issues of the English translation, as well as to Birkhauser and 
its editorial team for the smooth editing and publishing procedure. 


Miinster, May 2018 Siegfried Bosch 
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Introduction 


On the Problem of Solving Algebraic Equations 


The word algebra is of Arabic origin (ninth century AD) and means doing 
calculations on equations, such as combining different terms of the equation, or 
changing terms by suitable manipulations on both sides of the equation. Here an 
equation is meant as a relation between known quantities, so-called coefficients, 
and unknown quantities or variables, whose possible value is to be determined 
by means of the equation. In algebra one is mostly interested in polynomial 
equations, for example of type 


2a? + 3a7 + 7x — 10 = 0, 


where x stands for the unknown quantity. Such an equation will be referred to 
as an algebraic equation for x. Its degree is given by the exponent of the highest 
power of x that actually occurs in the equation. Algebraic equations of degree 1 
are called linear. The study of these or, more generally, of systems of linear 
equations in finitely many variables, is a central problem in linear algebra. 

On the other hand, algebra in the sense of the present book is about alge- 
braic equations of higher degree in one variable. In today’s language, this is the 
theory of field extensions together with all its abstract concepts, including those 
of group-theoretic nature that, in their combination, make possible a convenient 
and comprehensive treatment of algebraic equations. Indeed, even on an “ele- 
mentary” level, modern algebra is much more influenced by abstract methods 
and concepts than one is used to from other areas, for example from analy- 
sis. The reason becomes apparent if we follow the problem of solving algebraic 
equations from a historical viewpoint, as we will briefly do in the following. 

In the beginning, algebraic equations were used essentially in a practical 
manner, to solve certain numerical “exercises.” For example, a renowned prob- 
lem of ancient Greece (c. 600 BC — 200 AD) is the problem on the duplication 
of the cube. Given a cube of edge length 1, it asks to determine the edge length 
of a cube of double volume. In other words, the problem is to solve the algebraic 
equation x? = 2, which is of degree 3. Today the solution would be described 
by 7 = ¥/2. However, what is \/2 if only rational numbers are known? Since it 
was not possible to find a rational number whose third power is 2, one had to 
content oneself with approximate solutions and hence sufficiently good approx- 
imations of 2. On the other hand, the duplication of the cube is a problem of 
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geometric nature. Hence, it suggests to try a geometric solution if other com- 
putational methods do not work. On many occasions, we find in ancient Greece 
constructions by compass and straightedge, for example of Euclid, that rely on 
intersection points of lines and circles with objects of the same type. But by 
applying such a technique, it is still not possible to construct ¥/2, as we know 
today; cf. Section 6.4. Since constructions by compass and straightedge could 
not always lead to the desired solution, one also finds constructions in terms of 
more complicated curves in ancient Greece. 

Once it is accepted that for the solution of algebraic equations, say with 
rational coefficients, one needs the process of taking nth roots for variable n, 
besides the “rational” operations of addition, subtraction, multiplication, and 
division, we can pose the question whether a repeated application of these oper- 
ations will be sufficient for calculating the solutions from the coefficients. This 
is the fundamental question on the solvability of algebraic equations by radicals. 
For example, algebraic equations of degree 1 and 2 are solvable by radicals: 


z'+a=0 —> L=—-a, 


2 
e+arit+b=0 <> r= —5t4/>-0. 


The solvability of quadratic equations was basically already known to the Baby- 
lonians (from the end of the third millennium BC on), using elementary geo- 
metric methods, even if in specific examples that are conveyed, square roots 
were mainly taken from square numbers. From the ninth century AD on, after 
the Babylonian and the Greek periods had finished, the solution of quadratic 
equations was further refined by Arabian mathematicians. They also worked on 
the solvability of cubic equations and of equations of higher degree, however, 
without any noteworthy contribution to the subject. 

The sensational discovery that cubic equations are solvable by radicals was 
achieved only around 1515 by the Italian S. del Ferro. He considered an equation 
of type x? + ax = b for a,b > 0 and found as its solution 


Although he knew that before him, generations of mathematicians had worked 
on this problem without success, del Ferro kept his findings secret, without 
publishing them. However, we know about his work from the Ars Magna, some 
sort of schoolbook that was published by G. Cardano in 1545. Cardano had 
heard about del Ferro’s solution formula in an indirect way and then was able 
to work it out by himself. Furthermore, he realized that as a rule, equations of 
degree 3 should have three solutions. It is remarkable in his work that Cardano 
was less hesitant than his contemporaries to use negative numbers. Also, there 
are some first signs by him of the use of complex numbers. Finally, his student 
L. Ferrari discovered after 1545 that algebraic equations of degree 4 are solvable 
by radicals; see Section 6.1 for the corresponding formulas. 
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During the next two centuries, there was only little progress on the solvabil- 
ity of algebraic equations. F. Viete discovered the connection between the coef- 
ficients of an equation and its solutions, which carries his name. From today’s 
viewpoint this is a triviality if we use the decomposition of polynomials into 
linear factors. Furthermore, there was already a certain understanding about 
the multiplicities of solutions, including the idea that an algebraic equation of 
degree n should always have n solutions, counted with multiplicities, just as 
examples clearly show in ideal cases. However, it must be pointed out that the 
latter finding was only rather vague, since the nature of solutions, say real or 
complex, or even hypercomplex (neither real nor complex), was not made pre- 
cise. At that time, there were also several attempts, unsuccessful though, for 
example by G. W. Leibniz, to solve algebraic equations of degree 5 and higher 
by radicals. 

Finally, a certain consolidation of the situation was taking place by means of 
the fundamental theorem of algebra. The first ideas of its verification appeared 
in 1746 by J. d’Alembert, while further proofs of varying strength were carried 
out by L. Euler in 1749, by J. L. Lagrange in 1772, as well as later in 1799 by 
C. F. Gauss in his thesis. The theorem asserts that every nonconstant polyno- 
mial of degree n with complex coefficients admits precisely n complex zeros, 
counted with multiplicities, or in other words, that every such polynomial can 
be written as a product of linear factors. Even if the fundamental theorem of 
algebra did not directly contribute to the problem of solving algebraic equations 
by radicals, it nevertheless gave an answer to the question of where to look for 
solutions of such equations with rational, real, or complex coefficients. On this 
basis, further progress was achieved, particularly by Lagrange. In 1771 he sub- 
jected the solvability of algebraic equations of degree 3 and 4 to a complete 
revision and observed, among other things, that the cube roots in del Ferro’s 
formula must be chosen in accordance with the side condition 
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As a result, not nine possible values were obtained, but only the true solutions 
£1, £9, 13 of the equation x? + ax = b under consideration. However, more im- 
portant was the detection that on choosing a nontrivial third root of unity ¢, 
ie., a complex number ¢ # 1 satisfying ¢? = 1, the expression 


(x1 + x2 + C?a3)° 


takes only two different values on permuting the x; and thus must satisfy a 
quadratic equation (with coefficients from the considered number domain, for 
example the rational numbers). In this way, the sums Lm (1) + OLr(2) +C "rn (3); for 
any permutation 7, can be determined by solving a quadratic equation and sub- 
sequently extracting a cube root. In particular, since 71, #2, 73 can be calculated 
from these sums by means of rational operations, the solvability of the equation 
x + ax = b by radicals becomes clear. In a similar manner Lagrange charac- 
terized the solvability by radicals of algebraic equations of degree 4, in which 
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also in this case, permutations of the solutions play a major role. In this way, 
Lagrange introduced for the first time group-theoretic arguments into the dis- 
cussion, an approach that eventually led Galois to a complete characterization 
of the solvability by radicals for algebraic equations of arbitrary degree. 


Proceeding like Lagrange, Gauss studied, in 1796, the solutions of the equa- 
tion x? — 1 = 0 for prime numbers p > 2, relying on preparative work by 
A. T. Vandermonde. The corresponding permutations of the solutions give rise 
to groups again, “cyclic” ones in this case. Furthermore, the methods of Gauss 
led to new insight on the question of which regular polygons with a given number 
n of sides can be obtained in terms of compass and straightedge constructions. 
Around this time there were also studies by P. Ruffini, rendered more precise 
by N. H. Abel in 1820, showing that the “generic equation” of degree n is not 
solvable by radicals for n > 5. 


After such a number of partial results that were obtained mainly through 
a systematic application of group arguments, the time seemed to be ripe for a 
full clarification of the problem on the solvability of algebraic equations. This 
culminating step was successfully accomplished by E. Galois, with his brilliant 
ideas in the years 1830-1832. To a much greater extent than Abel, it was Galois 
who had very precise ideas about enlarging number domains, for example the 
rational numbers, by adding solutions of algebraic equations; from today’s point 
of view, it concerned a prestage of the notion of a field, as well as the technique of 
adjoining algebraic elements. Galois also introduced the notion of irreducibility 
for algebraic equations. Furthermore, he proved the primitive element theorem 
for the splitting field L of an algebraic equation f(x) = 0 with simple solutions, 
i.e., for the field generated by all solutions 21,...,x, of such an equation. The 
theorem asserts that there is an irreducible algebraic equation g(y) = 0 such that 
L contains all solutions y,,...,ys of this equation and, in addition, is obtained 
from the coefficient domain by adjoining any single one of the solutions y;. Now 
it was Galois’s idea to represent the x; in an obvious way as functions of y), say 
x; = hi(y,), and then to replace y, by an arbitrary element y;. As he showed, 
the elements hj(y;), i = 1,...,r, represent again all solutions of f(z) = 0. 
Furthermore, substituting y, by y; gives rise to a permutation 7, of the x;, and 
it follows that the 7; form a group, indeed, the “Galois group” of the equation 
f(a) = 0 as we say today. 


Based on these facts, Galois was led to the fundamental observation that 
the subfields of the splitting field L correspond in a certain way to the subgroups 
of the corresponding Galois group G, a result that we nowadays formulate in 
a refined way as the “fundamental theorem of Galois theory.” Finally, making 
use of this knowledge, Galois was able to show that the equation f(x) = 0 is 
solvable by radicals precisely when the group G admits a chain of subgroups 
G=G)D...D Gy, = {1}, where in each case, G;,1 is a normal subgroup of G; 
such that the factor group G;/Gj4, is cyclic. We could continue now discussing 
Galois theory in greater detail, but let us refer to the later Sections 4.1, 4.3, 
4.8, and 6.1 instead. 
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In any case, as we have seen, the delicate problem of solving algebraic equa- 
tions by radicals, which is quite easy to formulate, was fully clarified by Galois, 
due to his unconventional new ideas. In particular, one can now understand 
why, over many centuries, mathematicians were denied access to the problem. 
The solution does not consist of a comprehensible condition on the coefficients 
of the equation under consideration, say in terms of a formula. On the contrary, 
even to be formulated it requires a new language, more precisely, a new way of 
thinking in combination with new concepts, that could only be established in a 
long process of trial and error and of studying examples. Also we have to point 
out that the true benefit of Galois’s investigations does not concern so much his 
contribution to the solvability by radicals of algebraic equations, but instead, 
consists in the fundamental correspondence between algebraic equations and 
their associated “Galois” groups. Indeed, the fundamental theorem of Galois 
theory provides a means to characterize the “nature” of solutions of arbitrary 
algebraic equations in terms of group-theoretic properties. In view of this fact, 
the task of solving specific algebraic equations by radicals has largely lost its 
original significance. 


And how was Galois’s contribution perceived by his contemporaries? To 
give an impression, we take a brief look at Galois’s life; see also [11], Section 7. 
Evariste Galois was born in 1811 near Paris and died in 1832 at the age of only 
20 years. Already during his schooldays he looked at papers of Lagrange and 
wrote a first small treatise on continued fractions. Twice he tried to join the 
renowned Ecole Polytechnique in Paris, but was not able to pass the entrance 
examination, so that finally, he had to settle for the Ecole Normale. Here he 
began his studies in 1829, at the age of 18. In the same year he submitted a 
first Mémoire to the Académie des Sciences concerning the solution of algebraic 
equations. However, the manuscript did not receive any attention and was even 
lost, as was a second one that he submitted a week later. After another Mémoire 
had suffered the same fate in 1830, Galois made a final attempt in early 1831, 
submitting his paper on the solvability of algebraic equations by radicals that 
today is judged to be his most prominent work. This time it was refereed, but 
declined for reasons of immaturity and incomprehensibility. Disappointed that 
he could not find any recognition in mathematics, Galois turned his attention to 
the political events of his epoch. Due to his new activities he was several times 
arrested and eventually condemned to imprisonment. Finally, in May 1832 he 
was provoked to fight a duel, where he met his death. However, to preserve his 
work for posterity just in case he did not survive, Galois wrote a letter to a 
friend during the night before the duel, in which he put together his pioneer- 
ing discoveries in programmatic form. Although this program was published 
in 1832, the significance of Galois’s studies was not immediately recognized. 
One may speculate about the reason, but two facts are certainly responsible for 
this. Firstly, Galois was an unknown young mathematician, besides that with a 
dubious history. On the other hand, the characterization of the solvability of al- 
gebraic equations made such an inapproachable impression that no one among 
Galois’s contemporaries was prepared to accept this as a serious solution to 
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the problem. Also note that Lagrange, whose important preparative work was 
mentioned before, had died in 1813. 


We do not want to describe in full detail how Galois’s ideas eventually 
made their way to recognition and esteem. A major point is certainly the fact 
that J. Liouville, about 10 years after Galois’s death, came across his work 
and was able to publish a part of it in 1846. In fact, during the second half of 
the nineteenth century a phase began in which, among other things, one was 
concerned with understanding and polishing Galois’s ideas. Soon the problem 
of solvability of algebraic equations by radicals was reduced to its actual size. 
The problem was of extreme importance only because it had opened the door 
to an even more wide-ranging classification of irrational numbers, including 
transcendence aspects. Already in 1844 Liouville could establish the existence of 
transcendental numbers in a constructive way, a result that G. Cantor obtained 
more rigorously in 1874 using a countability argument. Similar studies of this 
type concern the transcendence proofs for e in 1873 by Ch. Hermite [8], as 
well as for 7 in 1882 by F. Lindemann [13]. Furthermore, some transcendence 
problems of general type were addressed in 1910 by E. Steinitz in his paper [15]. 


Through Galois’s work, it became apparent that focusing on single algebraic 
equations was somehow cumbersome. It was better to be flexible and to con- 
sider, so to speak, several equations at the same time, possibly also for different 
coefficient domains. This new insight led to the study of so-called algebraic field 
extensions, replacing single equations as considered before. The first to really 
follow this plan in Galois theory was R. Dedekind in his lectures 1855-1858 in 
G6ttingen. In particular, he interpreted Galois groups as automorphism groups 
of fields and not only as groups that permute the solutions of an algebraic equa- 
tion. Another significant improvement of the theory is due to L. Kronecker, who 
published in 1887 the construction principle for algebraic field extensions that 
is named after him. In this way, it became possible to set up Galois theory 
without relying on the fundamental theorem of algebra, and thereby to free it 
from the physical presence of complex numbers, for example in order to adapt 
it to finite fields. 


Taking into account all these developments, we are already quite close to 
the concepts that are still followed in the theory of field extensions today. Of 
course, there have been further completions, ameliorations, and simplifications 
that were essentially presented within the framework of books on the subject. 
Worth mentioning are—in historical order—the publications of H. Weber [17], 
B. L. van der Waerden [16], E. Artin [1], [2], as well as further pioneering books 
by N. Bourbaki [5] and S. Lang [12]. Even though the theory may nowadays 
seem to be “completed,” appearing in “optimal” shape, I would like to encourage 
the reader to remember from time to time the arduous journey the problem of 
solving algebraic equations has made from its beginnings on. Only if one bears in 
mind the enormous difficulties that had to be overcome will one understand and 
appreciate the fascinating solutions that mathematicians have found in difficult 
struggles over the course of centuries. 
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However, we do not want to give the impression that the investigation of 
algebraic equations has come to an end today. On the contrary, it has found its 
natural continuation in the study of systems of algebraic equations in several 
unknown quantities, within the fields of algebraic geometry, see [3], and number 
theory. Also concerning this setting, we can mention a problem that is easy to 
formulate, but which has resisted the attacks of mathematicians for a rather 
long period of time. It was solved only in the recent past, in the years 1993/94 
by A. Wiles with the help of R. Taylor. We are alluding to Fermat’s last theorem, 
a conjecture stating that the equation x” + y” = z” does not admit a nontrivial 
integer solution for n > 3. It is said that Fermat, around 1637, had noted this 
conjecture in the margin of his copy of Diophantus’s Arithmetica (c. 250 AD), 
adding that he had a truely marvelous demonstration for it, which, however, 
the margin was too narrow to contain. 


1. Elementary Group Theory M®) 
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Background and Overview 


There are two important reasons for considering groups in this book. On the 
one hand, the notion of a group exhibits a fundamental mathematical structure 
that is found, for example, in rings, fields, vector spaces, and modules, in which 
one interprets the inherent addition as a law of composition. All groups of 
this type are commutative or, as we also will say, abelian, referring to the 
mathematician N. H. Abel. On the other hand, there are groups originating 
from another source, such as the so-called Galois groups related to the work of 
E. Galois. These groups will be of central interest for us, serving as a key tool for 
the investigation of algebraic equations. From a simplified point of view, Galois 
groups are permutation groups, i.e., groups whose elements describe bijective 
transformations (self-maps) on sets like {1,...,n}. 


The main feature of a group G is its law of composition that assigns to a 
pair of elements g,h € G a third element goh € G, called the product or, in 
the commutative case, the sum of g and h. Such laws of composition are always 
around when one is doing calculations with numbers. But for a long time there 
was no need to pay special attention to the properties of these laws, since the 
latter were judged to be “evident.” Therefore, one can understand that up to 
the beginning of the seventeenth century, negative results from computations, 
for example from subtractions, were perceived as being “suspicious,” due to 
the fact that negative numbers did not have any precise meaning yet. However, 
from the nineteenth century on, the notion of a group began to take shape, 
notably when laws of composition were applied to objects that could not be 
interpreted as numbers anymore. For example, permutation groups played an 
important role in the attempts to solve algebraic equations. Since the related 
groups consist of only finitely many elements, it was still possible to formulate 
the group axioms without explicitly mentioning “inverse elements,” an approach 
that does not extend to the infinite case; cf. Exercise 3 of Section 1.1. An explicit 
postulation of “inverse elements,” and thereby an axiomatic characterization of 
groups from today’s point of view, emerged only at the end of the nineteenth 
century in the works of S. Lie and H. Weber. Prior to this, Lie, when studying 
his “transformation groups,” had still tried to derive the existence of inverse 
elements from the remaining group axioms, however without success. 
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In the present chapter we will explain some of the basics on groups, material 
that most readers will certainly be familiar with already. In addition to the 
definition of a group, we concentrate on normal subgroups, associated factor 
groups, as well as on cyclic groups. Already at this stage one can notice the 
lasting impact that the problem of solving algebraic equations, and in particular, 
Galois theory, have exercised on the development of groups. For example, the 
notion of a normal subgroup is strongly related to the fundamental theorem 
of Galois theory 4.1/6.1 Indeed, this theorem states, among other things, that 
an intermediate field F’ of a finite Galois extension L/K is normal over K in 
the sense of 3.5/5 if and only if the subgroup of the Galois group Gal(L/K) 
corresponding to Eis normal. Also note that referring to Proposition 1.2/3 as 
the theorem of Lagrange is inspired by group-theoretic arguments that Lagrange 
introduced when working on the solution of algebraic equations. 

More involved results on groups, and in particular, permutation groups, that 
are of special interest from the viewpoint of Galois theory, will be presented in 
Chapter 5. In addition, let us mention the fundamental theorem of finitely 
generated abelian groups 2.9/9, which provides a classification of such groups. 
Its proof will be carried out within the context of elementary divisors. 


1.1 Groups 


Let M be a set and M x M the Cartesian product with itself. An (inner) law 
of composition on M isamap M x M —+ M. In many cases we will write the 
image of a given pair (a,b) € M x M asa “product” a-b or ab. Thus, in terms 
of elements, the law of composition on M is characterized by the assignment 
(a,b) +> a- b. The law is said to be 

associative if (ab)c = a(bc) for all a,b,c € M, and 

commutative if ab = ba for alla,be M. 
An element e € M is called a unit element or a neutral element with respect to 
the law of composition on M if ea = a = ae holds for all a € M. Such a unit 
element e is uniquely determined by its defining property; usually we will write 
1 instead of e. A set M with a law of composition 0: M x M —> M is called 
a monoid if o is associative and M admits a unit element with respect to o. 

For a monoid M and elements aj,...,a, € M, the product 


n 
[ [ai ei ein ty 
i=1 


is defined. Note that a special bracketing on the right-hand side is not necessary, 
since the law of composition is assumed to be associative (use an intelligent 
inductive argument to prove this). Empty products are not excluded: we set 


oO 


II a; :=e = unit element. 
i=1 


' As an example, note that “4.1/6” refers to the sixth numbered item in Section 4.1. 
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For a € M and an exponent n € N, the nth power a” is defined in the usual 
way.” Note that a° = e, due to the convention on empty products. An element 
b € M is called an inverse of a given element a € M if ab =e = ba. Then 8, if 
it exists, is uniquely determined by a. Indeed, if ab! = e = b'a for some b’ € M, 
then 


b=eb=Vab=e=V. 


If an element a € M admits an inverse, it is denoted by a7!. 


Definition 1. A group is a monoid G such that every element of G admits 
an inverse. More explicitly, this means we are given a set G with a law of 
composition G x G —+ G, (a,b) —> ab, such that: 
(i) The law is associative, t.e., we have (ab)c = a(bc) for a,b,c € G. 
(ii) There exists a unit element, i.e., an element e € G such that ea = a = ae 
for all aeéG. 
(iii) Every element a € G admits an inverse, i.e., an element b € G such that 
ab =e = ba. 
The group is called commutative or abelian if the law is commutative, 1.e., 
if 
(iv) ab = ba for all a,beE G. 


Remark 2. In Definition 1 it is enough to require the following weaker condi- 
tions instead of (ii) and (iii): 

(ii’) There is a left neutral element in G, i.e., an element e € G satisfying 
ea=a forallaeG. 

(iii’) For each a € G there is a left inverse in G, i.e., an element b € G such 
that ba = e. 


For a verification of the fact that conditions (ii’) and (iii’) in conjunction 
with (i) are sufficient for defining a group, we refer to Exercise 1 below and to 
its solution given in the appendix. 

In dealing with abelian groups, the law of composition will usually be noted 
additively, i.e., we write a + b instead of a- b and }~ a; instead of [| a;, as well 
as n-a instead of an nth power a”. Accordingly, —a instead of a~! will denote 
the inverse of an element a, and 0 (zero element) instead of e or 1 will be the 
neutral element. Here are some examples of monoids and groups: 


(1) Z, Q, R, C, equipped with the usual addition, are abelian groups. 


(2) Q*, R*, C*, equipped with the usual multiplication, are abelian groups; 
the same is true for Qxy = {x € Q; x > 0} and Ryy = {x € R; x > O}. 
More generally, we can look at matrix groups from linear algebra like SL, and 
GL,, taking coefficients in Q,R, or C. For n > 1, the latter groups fail to be 
commutative. 


? 'N is the set of natural numbers including 0. 
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(3) N equipped with the usual addition, N, Z with the usual multiplication, 
are commutative monoids, but fail to be groups. 


(4) For a set X, let S(X) be the set of bijective maps X —> X. Then 
the composition of maps makes S(X) a group. This group is not abelian if 
X consists of at least three elements. In the special case X¥ = {1,...,n}, we 
put G,, := S(X) and call it the symmetric group of degree n or the group of 
permutations of the integers 1,...,n. Quite often a permutation 7 € G,, is 
described explicitly in the form 


Can a 


where 7(1),...,7(m) are the images under 7. By counting all ordered combina- 
tions of 1,...,n, we see that G,, consists of precisely n! elements. 


(5) Let X be a set and G a group. We write G* := Map(X,G) for the 
set of all maps X —+ G; it is canonically a group. Indeed, given f,g € G*, 
the product fg is defined by (f+ g)(x) := f(x)- g(x) for x varying over X. 
Thus, f +g is obtained by multiplying values of f and g with respect to the 
law of composition on G. We call G* the group of G-valued functions on X. 
In the same way, the group G) can be considered; it consists of all maps 
f: X — G satisfying f(#) = 1 for almost all x € X (ie., for all « € X, up 
to finitely many exceptions). The groups G* and G™) are commutative if G is 
commutative. Furthermore, G* coincides with G™) if X consists of only finitely 
many elements. 


(6) Let X be an index set and (G,)cx a family of groups. The set-theoretic 
product [<x G_, becomes a group if we define the composition of two elements 
(Gx)eex, (he)eex © [cx Ge componentwise via 


(9x)zex . (hz) wex = (Gx : Ny) vex: 


The group [],<¢x Gz is called the direct product of the groups Gz, x € X. 
In the special case X = {1,...,n}, the direct product is usually denoted by 
G,x...x G,. If all groups G, are copies of one and the same group G, then we 
have [cx Ge = G*, using the notation of the preceding example. In addition, 
if X is finite, say X = {1,...,n}, one writes G” instead of G* or G®). 


Definition 3. Let G be a monoid. A subset H C G is called a submonoid if H 
satisfies the following conditions: 

(i) ee H, 

(ii) abe H= abe H. 
If G is a group, H is called a subgroup of G if in addition, 

(iii) ae Hs ated. 
In particular, a subgroup of a group G is a submonoid that is closed under the 
process of taking inverses. 
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In defining a subgroup H C G, condition (i) can be weakened by simply 
requiring H # @), since this implies e € H using (ii) and (iii). Of course, similar 
reasoning is not possible for monoids. Every group G admits {e} and G as trivial 
subgroups. Given m € Z, the set mZ consisting of all integral multiples of m is 
a subgroup of the additive group Z. We will see in 1.3/4 that all subgroups of 
Z are of this type. More generally, every element a of a group G gives rise to a 
so-called cyclic subgroup of G. It consists of all powers a”, n € Z, where we put 
a” = (a~')~” for n < 0; see also Section 1.3. 


Definition 4. Let G, G’ be monoids with corresponding unit elements e and e'. 
A monoid homomorphism y: G —> G’ is a map y from G to G" such that 

(i) ple) =e, 

(ii) p(ab) = y(a)p(b) for all a,b € G. 
Furthermore, if G, G'’ are groups, yp is called a group homomorphism. 


Remark 5. A map y: G —> G" between groups is a group homomorphism if 
and only if y(ab) = y(a)y(b) for all a,be G. 


Proof. We conclude y(e) = e’ from y(e) = y(ee) = v(e)y(e). 


Remark 6. Let py: G — G’ be a group homomorphism. Then inverse elements 
satisfy p(a-') = (y(a))“1 for allaeG. 


Proof. e' = p(e) = y(aa~") = y(a)p(a-"). 


A group homomorphism y: G —> G’ is called an isomorphism if yp ad- 
mits an inverse, i.e., if there exists a group homomorphism w: G’ —> G such 
that wo y = idg and yow = ide, for idg and idg the identity maps on 
G and G’. Note that a group homomorphism is an isomorphism if and only 
if it is bijective. Injective (resp. surjective) group homomorphisms G —> G’ 
are called monomorphisms (resp. epimorphisms). An endomorphism of G is a 
homomorphism G —> G, an automorphism of G is an isomorphism G —> G. 

Let y: G —> G’ and w: G’ —> G" be group homomorphisms. Then the 
composition yoy: G —> G"” is a group homomorphism again. Moreover, given 
a group homomorphism y: G —> G’, we can consider the subgroups 


ker p = {9 €G; y(g) =1} CG (kernel of y) 
as well as 
imy = (G) c G (image of vp). 


Note that ¢ is injective if and only if ker p = {1}. We continue by listing some 
examples of homomorphisms. 


(1) Let G be a monoid. Fixing an element x € G’, the map 


nm 


yp: N—>G, nt 2”, 


14 1. Elementary Group Theory 


defines a monoid homomorphism when N is considered a monoid with respect 
to the usual addition. If G is a group, we obtain in the same way a group 
homomorphism 

yp: Z4—>G, ne > ox", 


where we put 2” := (a ')~" for n < 0. On the other hand, it is clear that 
each monoid homomorphism y: N —> G, resp. each group homomorphism 
yp: Z — G, must be of this type; just put « = y(1). 


(2) Let G be a group and S(G) the corresponding group of all bijective 
maps from G to itself. For a € G, let t € S(G) be the left translation by a on 
G, i.e., the map 

Tj: GG, gt ag. 


Then 
G — S(G) 


defines an injective group homomorphism, and we can identify G with its image 
in S(G), thereby interpreting G as a subgroup of $(G). In particular, every 
group consisting of n elements can be viewed as a subgroup of the symmetric 
group G,, a result generally known as Cayley’s theorem. 

Similarly as before, we can define right translations on G. Also these can be 
used to construct an injective group homomorphism G —>+ S(G); see Exercise 4 
below. 


(3) Let G be an abelian group, and fix n € N. Then 


at Ta, 


5] 


G—G, grog", 


is a group homomorphism. 


(4) Let G be a group, and fix a € G. Then 
pa: G —> G, gt aga", 


is a so-called inner automorphism of G. The set Aut(G) of all automorphisms 
of G is a group under the composition of automorphisms in terms of maps, and 
the map G —> Aut(G), a+ ga, is a group homomorphism. 


(5) The exponential function defines a group isomorphism 
R — Ryo, xt exp(z). 


Of course, to verify this we must use the properties of the exponential function 
known from analysis, notably the functional equation 


exp(x + y) = exp(x) - exp(y). 


Exercises 


1. Give a proof of Remark 2. 


1.2 Cosets, Normal Subgroups, Factor Groups 15 


2. The exponential function gives rise to an isomorphism between the additive group 
R and the multiplicative group Ryo. Check whether there can exist a similar iso- 
morphism between the additive group Q and the multiplicative group Qso. 


3. For a monoid G, consider the following conditions: 
(i) G is a group. 
(ii) For a,x, y € G, each of the equations ax = ay and ra = ya implies x = y. 
Then (i) => (ii). Show that the reverse implication holds for finite monoids G, 


but not for arbitrary monoids G. 


4. Let G bea group. Analogously to the notation of left translations, introduce right 
translations on G and use them to construct an injective group homomorphism 
G— S(G). 

5. Let X be a set and consider a subset Y C X. Show that the group S(Y) can be 
viewed canonically as a subgroup of S(X). 


6. Let G be a finite abelian group. Show that Il g =1. 
geEG 


7. Let G be a group such that a? = 1 for all a € G. Show that G is abelian. 


8. Consider a group G together with subgroups Hy, H2 C G. Show that Hy U Ho is 
a subgroup of G if and only if H, C H2 or H2 C Hy, holds. 


1.2 Cosets, Normal Subgroups, Factor Groups 


Let G be a group and H C G a subgroup. A left coset of H in G is a subset of 
G of type 
aH := {ah; he H}, 


where a€ G. 


Proposition 1. Any two left cosets of H in G have the same cardinality® and 
are disjoint if they do not coincide. In particular, G is the disjoint union of all 
left cosets of H in G. 


Proof. For each a € G, the left translation H —> aH, h -~ ah, is bijec- 
tive. Therefore, all left cosets of H in G have the same cardinality. The second 
assertion is a consequence of the following lemma: 


Lemma 2. Let aH and bH be left cosets of H in G. Then the following con- 
ditions are equivalent: 

(i) aH = bH. 
(ii) aH NbH FO. 
(iii) a € bH. 
(iv) ta € H. 


3 Two sets X,Y are said to have the same cardinality if there exists a bijection X —> Y. 
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Proof. The implication (i) == (ii) is trivial, since H 4 @. Next, assume (ii). 
There exists an element c € aH N bH, say c = ah, = bho, where hy, ho € H. 
This means that a = bhgh;' € bH, and we see that (iii) holds. Multiplication 
by b- and b shows that (iii) is equivalent to (iv). Finally, assuming (iv), we get 
a € bH and hence aH C bH. On the other hand, the inverse of b-'a € H must 
be contained in H as well: thus ab € H. Similarly as before we conclude that 
bH C aH and therefore aH = bH. 


All elements of a left coset aH are called representatives of this coset. In 
particular, a is a representative of aH, and we see from Lemma 2 that a’H = aH 
for every representative a’ € aH. The set of left cosets of H in G is denoted by 
G/H. Analogously one defines the set H\G of all right cosets of H in G, ice., 
of all subsets of type 

Ha={ha;he H}, 


where a € G. It is easily checked that the bijection 
G—G, grog, 


maps a left coset aH bijectively onto the right coset Ha~' and thereby defines 
a bijective map 
G/H — H\G, aH ++ Hat. 


In particular, Proposition 1 and Lemma 2 (with the obvious modifications in 
Lemma 2) are valid for right cosets as well. The number of elements in G/H, 
resp. H\G, is called the index (G: H) of H in G. Writing ord G for the number 
of elements of a group G and calling it the order of G, we can conclude from 
Proposition 1 the following corollary: 


Corollary 3 (Theorem of Lagrange). Let G be a finite group and H a subgroup 
of G. Then 
ordG = ord H -(G: #). 


Definition 4. A subgroup H C G is called a normal subgroup of G if aH = Ha 
for all a € G, i.e., if for each element a € G the associated left and right cosets 
of H in G coincide. If such is the case, the coset aH = Ha given by a is referred 
to as the residue class of a modulo H. 


The condition aH = Ha can be rewritten as aHa~! = H. Note that a 
subgroup H C G is normal as soon as we have aHa! C H for alla € G 
(alternatively: H C aHa™' for all a € G). Indeed, aHa™! C H is equivalent 
to aH C Ha, and likewise, a'Ha C H to Ha C aH. Moreover, observe that 
every subgroup of a commutative group is normal. 


Remark 5. The kernel of any group homomorphism yp: G — G' is a normal 
subgroup in G. 
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Proof. ker y is a subgroup of G,, and we get a: (ker y)-a~! C ker for alla € G 
from 1.1/6. 


Now, starting with a normal subgroup N C G, we want to look at the 
reverse problem of constructing a group homomorphism y: G —> G’ whose 
kernel coincides with N. To do this we introduce a suitable group structure on 
the set of residue classes G/N and define vy as the projection 7: G —> G/N, 
assigning to an element a € G the corresponding residue class aN. As a technical 
tool we define the product of two subsets X,Y C G by 


X-Y: ={#-yeG;,rcex, yer}. 


Then, using the fact that N is normal in G, we can write for a,b € G, 
(aN) - (DN) = {a} - (Nb). N = {a} - (ON) -N = {ab} - (NN) = (ab) N. 


As we see, the product of the cosets aN and bN with representatives a and b is 
a coset again, namely the one (ab)N with representative ab. Now, considering 
this product as a law of composition “.” on G/N, we conclude immediately from 
the properties of G being a group that G/N is a group again; N = 1N is the 
unit element in G/N, and a“!N is the inverse of aN € G/N. Furthermore, it 
is clear that the map 


tT: G—+G/N, at— aN, 


the canonical projection from G to G/N, is a surjective group homomorphism 
satisfying ker = N. The group G/N is called the factor group or the residue 
class group of G modulo N. 

For many applications it is important to know that the group homomor- 
phism 7: G —+ G/N satisfies a so-called universal property that characterizes 
G/N up to canonical isomorphism: 


Proposition 6 (Fundamental theorem on homomorphisms). Let y: G — G’ 
be a group homomorphism and N C G a normal subgroup such that N C ker y. 
Then there exists a unique group homomorphism GB: G/N —>+ G’ satisfying 
p=Go7, i.e., such that the diagram 


is commutative. Furthermore, 
im? =imy, ker @ = (ker vy), ker p = 1 ‘(ker 9), 


and it follows that P ws injective if and only if N = ker y. 
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Proof. If D exists, then 


P(aN) = (n(a)) = o(a) 


for a € G and we see that @ is unique. On the other hand, we can try to set 
G(aN) = y(a) when defining G. However, for this to work well, it is necessary 
to know that y(a) is independent of the choice of the representative a € aN. 
To justify this, assume aN = bN for two elements a,b € G. Then we have 
b-'a € N Ckery and thus ¢(b~'a) = 1, which yields y(a) = y(b). That Gis a 
group homomorphism follows from the definition of the group law on G/N, or in 
other words, from the fact that 7 is an epimorphism. This settles the existence 
of GB. 

Finally, the equation ker p = 1~'(ker G) follows from the fact that y is the 
composition of @ and 7. Moreover, we can conclude imG = imy as well as 
ker @ = z(ker y) from the surjectivity of 7. 


Corollary 7. If y: G — G' is a surjective group homomorphism, then G" is 
canonically isomorphic to G/ ker vy. 


As an application of Proposition 6, we want to prove the so-called isomor- 
phism theorems for groups. 


Proposition 8 (First isomorphism theorem). Let G be a group, H C Ga 
subgroup, and N C G a normal subgroup of G. Then HN is a subgroup of G 
admitting N as a normal subgroup, and HAN is a normal subgroup of H. The 
canonical homomorphism 


H/HAN — HN/N 


is an isomorphism. 


Proof. Using the fact that N is normal in G, one easily shows that HN is a 
subgroup of G. Furthermore, N is normal in HN, since it is normal in G. Now 
consider the composition of homomorphisms 


Ho HN > HN/N, 


where 7 is the canonical projection. It is surjective and has HM N as its kernel. 
Therefore, HM N is a normal subgroup in A, and the induced homomorphism 


H/HAN — HN/N 


is an isomorphism, due to Proposition 6 or Corollary 7. 


Proposition 9 (Second isomorphism theorem). Let G be a group and let N, H 
be normal subgroups of G satisfying N C H CG. Then N is normal in H as 
well, and one can view H/N as a normal subgroup of G/N. Furthermore, the 
canonical group homomorphism 


1.2 Cosets, Normal Subgroups, Factor Groups 19 


(G/N) /(H/N) — G/H 
is an isomorphism. 


Proof. To begin with, let us explain how to view H/N as a subgroup of G/N. 
Look at the group homomorphism 


Ps C/N, 


where a is the canonical projection. Since this homomorphism admits N as 
kernel, it induces by Proposition 6 a monomorphism H/N <= G/N. Thus, we 
can identify H/N with its image in G/N. 

Next observe that the kernel of the canonical projection G —+ G/H, which 
is H, contains N as a normal subgroup. Therefore, using Proposition 6, the 
projection G —> G/H induces an epimorphism G/N —> G/H whose kernel 
is normal in G/N and coincides with the image of H under the projection 
G —+ G/N; the latter image was identified with H/N before. Now, applying 
Proposition 6 or Corollary 7 again, we see that G/N —> G/H gives rise to an 
isomorphism 


(G/N) /(H/N) - G/H. 


Exercises 


1. Let G be a group and H a subgroup of index 2. Show that H is normal in G. Is 
the same assertion true in the case that H is of index 3 in G? 


2. Let G be a group and N C G anormal subgroup. Give an alternative construction 
of the factor group G/N. Proceed as follows: Consider the set X = G/N of all left 
cosets of N in G and show that there is a group homomorphism y~: G —> S(X) 
such that ker p = N. 


3. Let X be a set, Y C X a subset, G a group, and G* the group of G-valued 
functions on X. Let N := {f € G*; f(y) = 1 for all y € Y}. Show that N isa 
normal subgroup of G* satisfying G* /N ~ GY. 

4. Let y: G — G'" be a group homomorphism. Show: 

(i) If H C Gis a subgroup, then y(/) is a subgroup in G’. The corresponding 
assertion for normal subgroups is valid only if y is surjective. 

(ii) If H’ c G" is a subgroup (resp. normal subgroup) in G’, the same is true 
for p1(H’) CG. 

5. Let G be a finite group, and let Hi, Hz C G be subgroups satisfying Hi C Ho. 
Show that (G: H,) = (G: Ha) - (Hy: Mj). 

6. Let G be a group and N C G a normal subgroup satisfying the following maxi- 
mality condition: If H ¢ G is a proper subgroup containing N, then it coincides 
with N. Show for all subgroups Hy, H2 C G satisfying H; # {1} # Ho and 
AH, ON = H20N = {1} that Hy is isomorphic to Ho. 
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For a group G and asubset X C G, define H as the intersection of all subgroups 
of G containing X. Then H is a subgroup of G, in fact the (unique) smallest 
subgroup of G containing X. We say that H is generated by X or, if H coincides 
with G, that G is generated by X. The subgroup H C G can be described in 
more explicit terms. It consists of all elements 


where 21,...,U, € X and €1,...,€, € {1,-1}, with n varying over N. Clearly, 
the elements of this type form the smallest subgroup of G containing X and 
thus by definition constitute the subgroup H CG. 

For the moment we are interested only in the case in which X consists of 
a single element « € G. The subgroup generated by x in G is denoted by (2), 
and its description simplifies to the following: 


Remark 1. Let x be an element of a group G. Then the subgroup (x) C G 
generated by x in G consists of all powers x", n € Z. In other words, (x) 
coincides with the image of the group homomorphism 


Z— G, nre> ax”, 


where Z means the additive group of all integers. In particular, (x) is commu- 
tative. 


Definition 2. A group G is called cyclic if it is generated by a single element. 
This is equivalent to the fact that there exists a surjective group homomorphism 
Z— G. 


Observe that for a commutative group G with additively written law of 
composition, the map Z —>+ G from Remark 1 is given by n +> n- x, where 
n-x is to be interpreted as the n-fold sum of x for n > 0 and as the (—n)-fold sum 
of —x for n < 0. In particular, the additive group Z is generated by the element 
1 € Zand therefore is cyclic. It is called the free cyclic group; its order is infinite. 
On the other hand, given m € Z, the subgroup mZ of all integral multiples of 
m is cyclic, since it is generated by m = m-1. The factor group Z/mZ is cyclic 
as well, generated by the residue class 1+mZ. If m 4 0, say m > 0, then Z/mZ 
is called the cyclic group of order m. Indeed, Z/mZ, where m > 0, consists of 
precisely m elements, namely the residue classes 0 + mZ,...,(m— 1) + mZ. 
In the following we want to show that Z and the groups of type Z/mZ are 
the only cyclic groups, up to isomorphism. Due to the fundamental theorem on 
homomorphisms (in the version of 1.2/7) we see that a group G is cyclic if and 
only if there exists an isomorphism Z/H —> G, for H a (normal) subgroup of 
Z. Therefore, in order to determine all cyclic groups it is enough to determine 
all subgroups of Z. 
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Proposition 3. Let G be a cyclic group. Then: 


Z, if ord G = ov, 
~ |Z/mZ, if ord G =m < 0. 
In particular, the groups Z and Z/mZ for integers m > 0 are the only cyclic 
groups, up to isomorphism. 


As we have seen before, to prove the proposition it is enough to establish 
the following lemma: 


Lemma 4. Let H C Z be a subgroup. Then there exists an integer m € Z such 
that H = mdZ. In particular, every subgroup of Z is cyclic. 


Proof. We may assume H ¥ 0, i.e., that H is different from the zero subgroup 
of Z given by the zero element. Then H must contain positive integers; let m 
be the smallest among these. We claim that H = mZ, where clearly, mZ C H. 
To show the reverse inclusion, let a € H. Using Euclidean division of a by m, 
there are integers q,r € Z,0<r<™m, such that a= qm+r. Then r=a—qm 
belongs to H. However, since all positive integers in H are greater than or equal 
to m, we must have r = 0. Thus, a = gm € mZ and therefore H C mZ. All in 
all, we get H = mZ. 


Proposition 5. (i) Every subgroup H of a cyclic group G is itself cyclic. 
(ii) If p: G — G' is a group homomorphism, where G is cyclic, then ker yp 
and im are cyclic. 


Proof. It follows immediately from the definition of cyclic groups that the image 
of a cyclic group under a group homomorphism y: G —> G’ is cyclic. Since 
ker y is a subgroup of G, it remains to verify assertion (i). Therefore, let G 
be cyclic and let H C G be a subgroup. Furthermore, let 7: Z —> G be an 
epimorphism. Then 7~!(#) is a subgroup of Z and therefore cyclic by Lemma 4. 
But then H is cyclic, since it is the image of 7~'(H) with respect to 7, and 
assertion (i) follows. 


Let G be a group. The order orda of an element a € G is defined as the 
order of the cyclic group generated by a in G. As we know already, y: Z — G, 
n + a”, yields an epimorphism from Z onto the cyclic subgroup H Cc G 
that is generated by a. If ker y = mZ and G is finite, then necessarily m 4 0, 
say m > 0, and H is isomorphic to Z/mZ. Thus, m is the smallest positive 
integer satisfying a” = 1, and we see that H consists of the (distinct) elements 
1 =a°,a!,...,a”~!. In particular, orda = m. 

Proposition 6 (Fermat’s little theorem). Let G be a finite group, a € G. Then 
orda divides ordG and we have a%?% = 1. 
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Proof. Apply the theorem of Lagrange 1.2/3 to the cyclic subgroup of G that is 


generated by a. 


Corollary 7. Let G be a finite group such that p := ordG is prime. Then G is 
cyclic, G ~ Z/pZ, and every element a € G, a #1, is of order p. In particular, 
every such element a generates G. 


Proof. For each element a € G, a 4 1, consider the cyclic subgroup H CG 
generated by a. Then ord a = ord H is different from 1 and, according to Propo- 
sition 6, a divisor of p = ordG. Since p is prime, we get orda = ord H = p. 
Therefore, H = G, i.e., G is generated by a and hence is cyclic. Furthermore, 


G is isomorphic to Z/pZ, due to Proposition 3. 


Exercises 


1. 


For m € N—{0} consider the set Gm := {0,1,...,m— 1} and define a law of 
composition on it via 


aob:= the remainder of a+ b with respect to division by m. 


Give a direct argument showing that “o” constitutes a group law on Gm and that 
the resulting group is isomorphic to Z/mZ. 


Determine all subgroups of Z/mZ for m € N—{O}. 
Consider Z as an additive subgroup of Q and show: 
(i) Every element in Q/Z is of finite order. 
(ii) The factor group Q/Z admits for each n € N — {0} precisely one subgroup 
of order n, and it is cyclic. 


Let m,n € N — {0}. Show that the groups Z/mnZ and Z/mZ x Z/nZ are iso- 
morphic if and only if m and n are relatively prime. In particular, a product of 
two finite cyclic groups whose orders are relatively prime is itself cyclic. 


Let y: Z” —+ Z” be an endomorphism of the n-fold product of the additive 
group Z, where n € N. Show that y is injective if and only if Z"/im ¢ is finite. 
Hint: Consider the homomorphism of Q-vector spaces Yo: Q” —+ Q” attached 
to . 
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Background and Overview 


A ring is an abelian additive group R that is equipped with an additional mul- 
tiplication, just like the ring Z of integers. More specifically, it is required that 
R bea monoid with respect to the multiplication and that the multiplication be 
distributive over the addition. We will always assume that the multiplication of 
a ring is commutative, except for a few occasions in Section 2.1. If the nonzero 
elements of a ring form an (abelian) group under the multiplication, the ring is 
actually a field. In principle, the definition of a ring goes back to R. Dedekind. 
For Dedekind, rings were motivated by questions in number theory involving 
intergral elements in algebraic number fields, or in other words, by the study of 
algebraic equations with integer coefficients. However, we will deal with rings 
of integral algebraic numbers only occasionally. More important for us are fields 
serving as coefficient domains for algebraic equations, as well as polynomial 
rings over fields. These are of fundamental importance in studying algebraic 
equations, and in particular algebraic field extensions. In the following, let us 
have a first look at polynomials. 
If we want to solve an algebraic equation 


(x) ota} +... +a, =0, 


say with coefficients a,,...,d@, in a field kK, we can try to view the symbol 
x as a quantity that is “variable.” More precisely, we consider the expression 
f(x) =a" +a,2""!+...+ 4p as a function assigning to an element x the value 
given by f(x). Then, of course, we have to determine the zeros of the function 
f(a). On the other hand, to be strict, we must fix the domain where x may vary, 
for example K itself or, if AK = Q, also the real or complex numbers. We say 
that f(x) is a polynomial function in x or, by abuse of language, a polynomial 
in &. 

However, finding out about a suitable domain of definition that is big enough 
to contain “all” zeros of f is a basic problem. From a historical point of view, the 
fundamental theorem of algebra is a good device to settle this point. It implies 
for every subfield & C C that all solutions of (*) that can appear in extension 
fields over K may be viewed as complex numbers. Therefore it is appropriate 
to interpret f(a) in this case as a polynomial function on C. Problems of a 
different kind arise when one is considering algebraic equations with coefficients 
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from a finite field F; cf. 2.3/6 or Section 3.8 for the definition of such fields. For 
example, if F consists of the elements 71,...,2,, then 


g(x) = [[@ —2;)=a9+...4+(-1)!a1...2,4 


j= 


is a polynomial function that vanishes on all of F, although not all of its “coef- 
ficients” are zero. Thereby we see, depending on the domain of definition, that 
it is not always possible to reconstruct the coefficients of the equation (*) if the 
attached polynomial function f(x) is known as a map only. 

To avoid such difficulties one refrains from the idea that a polynomial might 
be a function on a certain domain of definition. Instead one tries to implement 
the following two aspects. The first is that there should be a one-to-one corre- 
spondence between polynomials and their “coefficient” sequences. On the other 
hand, one likes to retain the possibility of relating polynomials to functions, in 
such a way that polynomials can be evaluated at elements of certain fields (or 
rings) extending the given domain of coefficients. To achieve this we define a 
polynomial with coefficients ao,..., dn as a formal sum f = })_, a;X7, where, 
in down-to-earth terms, this just means that f is identified with the sequence 
of its coefficients ao,...,@n. If the coefficient domain is a field (or a ring), we 
can add and multiply polynomials in the usual way by applying the conven- 
tional rules formally. In this way, all polynomials with coefficients in a field A’ 
form a ring K[X]. Also note that we can evaluate such polynomials f € K[X] 
at elements x belonging to arbitrary extension fields (or rings) K’ D K; just 
substitute the variable X by x and consider the resulting expression f(a) as 
an element of K’. In particular, we can talk about zeros of f in K’. We will 
study this formalism more closely for polynomials in one variable in 2.1 and for 
polynomials in several variables in 2.5. 

Now the problem of solving algebraic equations with coefficients in a field 
can be phrased in a slightly more precise way as follows: determine the zeros in 
suitable extension fields A’ containing AK for monic polynomials with coefficients 
in K, ie., for polynomials of type f = X"+a,X""!+...+a, € K[X]. There 
is one reduction step that should be applied if possible. If the polynomial f can 
be written as the product of two polynomials g,h € K[X], ic., f = gh, then 
to specify the zeros of f it is enough to specify the zeros of g and h separately. 
The reason is that we have f(x) = (gh)(x) = g(x)h(x) for « € K’, as is verified 
without difficulty. Since the latter equation has to be read in a field, we see 
that f vanishes at x precisely when g or h vanishes at this point. Therefore, to 
simplify the problem, one should try to reduce the algebraic equation f(x) = 0 
to equations of lower degree, by factoring f in K[X] into a product of monic 
polynomials of lower degree. If that is impossible, then f as well as the algebraic 
equation f(x) = 0 are said to be irreducible. 

In particular, the preceding considerations show that factorizations of poly- 
nomials should be studied. We will do this in 2.4. Starting out from the fact 
that there is a so-called Euclidean division in polynomial rings over fields, i.e., a 
division process with remainder, we will show that the theorem of unique prime 
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factorization is valid in K[X], just as it is in the ring Z of integers. Hence, 
we conclude that every monic polynomial admits a unique factorization into 
irreducible monic polynomials. Further considerations in 2.7 and 2.8 will deal 
with criteria for irreducibility and thereby with the question of how to decide 
whether a given polynomial f € K [|X] is irreducible. 


There is another reason why prime factorizations in polynomial rings K LX | 
are of special interest. To explain this in more detail, let us briefly touch upon 
the notion of ideals, a concept that belongs to the basics of ring theory; it will 
be dealt with in 2.2. An ideal a of a ring R is an additive subgroup of R such 
that ra € a for all r € R and all a € a. In many respects ideals behave like 
normal subgroups of groups. For example, we can construct the residue class 
ring R/a of a ring R by an ideal a C R, prove the fundamental theorem on 
homomorphisms, and so on; cf. 2.3. Ideals appeared in mathematics at the end 
of the nineteenth century, alongside attempts to extend the theorem on unique 
prime factorization from the ring of integers Z to more general rings of algebraic 
integers. When it was realized that this was impossible in the general case, one 
started looking at factorizations into so-called ideal numbers. However, finally it 
was Dedekind who observed that instead of factorizations of single elements, one 
should rather concentrate on factorizations of certain subsets, which he called 
ideals, of the given ring. In this way, Dedekind proved in 1894 the theorem on 
the unique prime factorization for ideals in rings of algebraic integers. Today an 
integral domain, i.e., a nonzero ring without nontrivial zero divisors, is called a 


Dedekind domain if Dedekind’s theorem holds for it. 


For us it is important to know that the polynomial ring K [|X] over a field 
is a principal ideal domain; this means that it is an integral domain and that ev- 
ery ideal a C K[X] is of type (f), i-e., generated by a single element f € K[X]. 
This result will be established in 2.4/3. Furthermore, we show that the theorem 
on unique prime factorization is valid in principal ideal domains. Investigations 
of this kind lead directly to the so-called construction of Kronecker, which will 
be discussed at length in 3.4/1. Given an irreducible algebraic equation f(a) = 0 
with coefficients in a field kK, the construction allows one to specify in a simple 
way an extension field A’ that contains a solution. Indeed, set K’ = K[X]/(f), 
check that it is a field naturally extending A’, and observe that the residue class 
X of X € K[X] solves the equation. Even if the construction does not provide 
any closer details on the structure of the field K’, for example on the solvability 
by radicals, it nevertheless gives a valuable contribution to the question of the 
existence of solutions. 


To illustrate the potential of principal ideal domains, we present at the end 
of the chapter in 2.9 the theory of elementary divisors, a topic that actually 
belongs to the domain of linear algebra. As a generalization of vector spaces 
over fields, we study “vector spaces” or, as one prefers to say, modules over 
rings and, in particular, over principal ideal domains. 
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2.1 Polynomial Rings in One Variable 


Definition 1. A ring (admitting a unit element) is a set R together with two 
(inner) laws of composition written as addition “+” and multiplication “:” such 
that the following conditions are satisfied: 
(i) Ris a commutative group with respect to addition. 
(ii) R is a monoid with respect to multiplication, i.e., the multiplication is 
associative, and there exists a unit element in R. 
(iii) The distributive laws hold, i.e., 


(a+b)-c=a-c+b-c, c:(a+b)=c-a+c-b, fora,b,cE R. 
R is called commutative if the multiplication is commutative.' 


On the right-hand sides of the distributive laws in (iii) we have refrained 
from introducing a special bracketing. Just as for computations with ordinary 
numbers, it is common that multiplication is granted a higher precedence than 
addition. For every ring, the zero element of the addition will be denoted by 
0, the unit element of the multiplication by 1. Note that the case 1 = 0 is 
not excluded; it characterizes the so-called zero ring, which consists of a single 
element 0. If no confusion is possible, the zero ring is denoted by 0 as well. For 
calculations in rings, one may use essentially the same rules as for calculations 
in terms of ordinary numbers, e.g., 


0-a=0=a-0, (—a)-b (ab) = a- (—d), for a,be R. 


However, note that from ab = ac, resp. a: (b—c) = 0 (even for a # 0), we 
cannot necessarily conclude b = c. The latter equality can be obtained only 
when one is dealing with so-called integral domains (see below) or in the case 
that a admits an inverse element with respect to multiplication. Thus, caution 
is required when applying cancellation rules. 

Let R be a ring. A subset SC R is called a subring of R if S is a subgroup 
with respect to the addition on R and a submonoid with respect to the multi- 
plication on R. In particular, using the laws of composition inherited from R, 
it is clear that S is a ring again. The pair S C R is called a ring extension. 

Given a ring R, we write 


R* = {a € R; there exists b € R such that ab = ba = 1} 


for the set of multiplicatively invertible elements; these are referred to as the 
units of R. It is easily checked that R* is a group with respect to multiplication. 
R is called a division ring or a skew field if R A 0 and R* = R— {0}, ie., if 
1 40 and each nonzero element of R is a unit. In addition, if the multiplication 


| Although we will refer to a few notions and examples of noncommutative rings in this 
section, we will generally assume that all rings are commutative, unless stated otherwise. 
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of R is commutative, R is called a field. An element a of a ring R is called a 
zero divisor if there exists an element b € R — {0} such that ab = 0 or ba = 0. 
Fields and skew fields do not admit any zero divisors, except for 0, the trivial 
zero divisor. Finally, a commutative ring R is called an integral domain if it is 
nonzero and does not admit nontrivial zero divisors. We give some examples of 
rings. 

(1) Z is an integral domain whose group of units consists of the elements 1 
and —1. 


(2) Q, R, C form fields, the Hamiltonian quaternions H a skew field. For 
completeness, let us recall the construction of H. Start with a 4-dimensional 
R-vector space V, say with a basis e,7, 7, k. Set 


e=e, ei=ie=i, ej=je=j, ek=ke=k, 
P=P=k = -e, 


ij=—ji=k, jk=—-kj=i, ki=—ik =j, 


and define the product of arbitrary elements in V by R-linear extension. The 
resulting multiplication, together with the vector space addition, makes V a 
(noncommutative) ring H, even a skew field with e as unit element. Identifying 
the field R of real numbers with Re, we can view R as a subfield of H, i.e., as a 
subring that is a field. In a similar way, we can interpret C as a subfield of H. 


(3) Let AK be a field. Then R = K"*”", the set of all (n x n) matrices 
with coefficients in kK defines a ring together with the ordinary addition and 
multiplication of matrices; its group of units is 


R* ={Ae k™"; det AF 0}. 


Note that R is noncommutative for n > 2 and that in this case, R admits non- 
trivial zero divisors. More generally, we can state that the set of endomorphisms 
of a vector space V (or, alternatively, of an abelian group G) is a ring. Here the 
addition of endomorphisms is defined via the inherent addition on V resp. G, 
and the multiplication as composition of endomorphisms. 


(4) Let X be a set and R aring. Then R*, the set of R-valued functions 
on X, becomes a ring if we set for f,g € R*, 


ft+g:X SR, xr > f(x) +g(2), 

f-g:X SR, ur f(x)- g(x). 
In particular, if X = {1,...,n} C N, we can view R* as the n-fold Cartesian 
product R= Rx... x R, the ring structure on R” being defined by 
(@1,---,;%n) + (Yi,---5 Yn) = (41 + Y1,---, Ln + Yn); 
(Sis esa) Ysa) = Bre Yiysseo Bat Ya) 


(*) 


The zero and the unit elements are given by 0 = (0,...,0) and 1 = (1,...,1). 
Furthermore, the equation (1,0,...,0)-(0,1,...,1) = 0 shows for n > 2 that 
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R" will generally admit nontrivial zero divisors, even if R itself is an integral 
domain. We call R” the n-fold ring-theoretic product of R with itself. More 
generally, we can consider the ring-theoretic product 


P=|[2, 


rEX 


of a family of rings (R,)zex. Addition and multiplication on P are defined 
componentwise, just as in the formulas (*). If the rings R, are copies of one and 
the same ring R, then ]],,.. R, and R* coincide naturally. 


From now on we will restrict ourselves to commutative rings. Thus, unless 
stated otherwise, the term ring will always be used in the sense of a commutative 
ring. Starting out from such a ring R, we want to construct a ring extension 
R[X], the so-called polynomial ring in a variable X and with coefficients in R. 
In terms of sets, we let RIX] := R%), where as usual, R®) stands for the set 
of all maps f : N —> R satisfying f(i) = 0 for almost all i € N. Identifying a 
map f : N —>+ R with its corresponding sequence (f(7));en of images in R, we 
can write 


RS) = { (ai )ien ; a; € R, a; =0 for almost all i € N}. 


Now, in order to introduce a ring structure on R™), define the addition compo- 
nentwise, i.e., by 
(aj) + (0;) = (a; + Bi). 


Note that in terms of maps N —> R, this corresponds to the usual addition as 
considered in example (4) above. Concerning the multiplication, we proceed dif- 
ferently and use a construct that is inspired by the multiplication of polynomial 
functions. In fact, we set 


where 


It can readily be checked that R“) becomes a ring; the zero element is given by 
the sequence (0,0,0,...) and the unit element by the sequence (1,0,0,...). One 
writes R[X] for the ring thus constructed, calling it the ring of polynomials in 
one variable X over R. The role of the “variable” X becomes more plausible if 
we write elements (a;) € R[X] in the familiar polynomial form 


n 
y a,X* or y a, X', 
i=0 


iEeN 


where n is large enough that a; = 0 for i > n and where X is given by the se- 
quence (0,1,0,0,...). In terms of polynomial sums, addition and multiplication 
in R[X] are described by the familiar formulas 
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So aiX'+ So b.X! = SO (ai + bi) X", 
SaiX?. $7 b.X? = y( S> a, by) X°. 


a p+v=t 


Finally, to interpret R as a subring of R[X], we view the elements of R as 
constant polynomials in R[X], i.e., we use the map R > R[X], a+ aX°, as 
an identification. This is permitted, since the injection respects ring structures 
on R and R[LX] and thus is a homomorphism of rings, as we will say. 

To explain more closely the significance of the “variable” X, consider a 
ring extension R C R’ and a polynomial f = 5° a;X' in R[X]. Then we can 
substitute the variable X by any element 2 € R’ and thereby compute the 
value f(x) = S>a;x" of f at x. In particular, f gives rise to a well-defined map 
R' — R’, x+> f(x), where 


(f+ 9)(2) = f(a) +g(@), (fF -9)(@) = f(x) - 9(@) 


for f,g € RLX]. Notice that in order to establish the multiplicativity of the 
right-hand equation, we need the commutativity of the multiplication on R’ 
or, what is enough, the permutability relation ax = xa for a € R, x € R’. In 
particular, in doing computations in the polynomial ring R[X], the “variable” 
X behaves like a universally variable quantity with the special property that 
equations in RX] are preserved when X is substituted by elements in R’. 

Given a polynomial f = >> a,;X' € R[X], its ith coefficient a; is called its 
coefficient of degree i. Moreover, the degree of f is defined by 


deg f := max{i; a; 4 0}, 


where we assign the degree —oo to the zero polynomial 0. If deg f = n > 0, then 
dy, is referred to as the highest or the leading coefficient of f. If it is 1, we say 
that f is monic. Each polynomial f € R[X]—{0} whose leading coefficient an, 
is a unit can be transformed into a monic one by multiplication by the inverse 
iat Gigs 


Remark 2. Consider the polynomial ring RLX] in a variable X over a ring R 
and let f,g € RLX]. Then 


deg(f + g) < max(deg f, deg q), 
deg(f - g) < deg f + deg g, 


and even deg(f +g) = deg f +degg if R is an integral domain. 


Proof. The assertions are clear if f or g is zero. Therefore, assume m = deg f > 0, 
as well as n = deg g > 0, say f = S>a;X', g = 95 0;X*. Then we conclude that 
a; + b; = 0 for i > max(m,n) and hence deg(f + g) < max(m,n). Similarly, we 
see that )7,,4,<; %ub, = 0 for i > m+n and therefore get deg(f-g) <m-+n. 
Finally, if R is an integral domain, then deg f = m and degg = n imply 
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that the coefficients am, b, are nonzero and hence that >> fro re Ayby = Ambn, 
which is the coefficient of degree m+n in f -g, is nonzero. This shows that 
deg(f-g) =m-+n. 


There are several properties of rings that a polynomial ring R[_X] inherits 
from its ring of coefficients R. As a simple example, we look at integral domains. 


Remark 3. Let R be an integral domain. Then the polynomial ring R[X] is 
an integral domain as well. Furthermore, (RLX])* = R*. 


Proof. Use the relation deg(f - g) = deg f + deg g from Remark 2. 


Finally, we want to establish Euclidean division for polynomial rings, a 
process that is particularly known from the ring of integers Z. Euclidean division 
is used in 2.4 in order to show that polynomial rings over fields are unique prime 
factorization domains. 


Proposition 4. Let R be a ring and g = Yx¢,a;X' € R[X] a polynomial 
whose leading coefficient ag is a unit in R. Then, for each f € RLX], there 
exist unique polynomials q,r € R[X] such that 


f=ag+r7, degr < d. 


Proof. First observe that we have deg(qg) = degq + degg for arbitrary poly- 
nomials g € R[LX], even if R is not an integral domain. Indeed, the leading 
coefficient aq of g is a unit. So if q is of some degree n > 0 with leading coef- 
ficient c,, then c,aq # 0. However, this is the leading coefficient of gg, and we 
conclude that deg(qg) = n+ d. 

Next, to justify the uniqueness assertion, consider a polynomial f € RLX | 
admitting two decompositions of the desired type, say f = qgtr=qg+r". 
Then we get 0 = (q¢— q')g + (r — 1") and, by the above argument, 


deg(q — q') + deg g = deg(r — 1’). 


Since r and r’ are of degree < d, the same is true for r — r’, and we sce that 
deg(q—q’)+deg g < d. However, this can be true only for g = q’, since deg g = d. 
But then we must have r =r’ as well, and the uniqueness assertion is clear. 

To derive the existence part of Euclidean division we proceed by induction 
on n = deg f. If deg f < d, set gq = 0 and r = f. On the other hand, if we have 
f = eo GX" as well as c, #0 and n > d, then 


f= f—cnay'X"%g 


is a polynomial of degree < n. By the induction hypothesis, this admits a 
decomposition f; = qig +11 with polynomials q,r; € R[X], where degr, < d. 
Hence, 
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f=(a+eqa)'X" gtr 


is a decomposition of f, as desired. 


The argument presented in the above proof can be used to explicitly carry 
out Euclidean division in polynomial rings R[X], similarly to how this is done 
for the ring of integers Z. To give an example, consider the polynomials 


fox? 43X34 XP = 6X? —X +1, ga X?4OxX*4 X=4 
from ZX]: 


(X° 4+3X4 +X -6X? —X 41) : (X942X724X-1) = X74 X-2 
X5 42X4 +X3 —xX? 


Pie —5X? —-X 

X* +2X° 4X? —X 
—2X° —6xX? +1 
—2X3 —4X? -2X 42 
—2X? 4+2X -1 


In a first step we subtract Xg from f, then in a second Xg from f — X2g, and 
in a third —2g from f — X2g — Xg. We obtain —2X? + 2X — 1 as remainder, 
thus leading to the decomposition 


f =(X? 4X —2)g+ (-2X7+2X — 1). 


Finally, let us mention that the construction of the polynomial ring RLX | 
can be generalized. For example, in 2.5 we will introduce polynomial rings in 
several variables. On the other hand, we can replace the set R® by RN, the set 
of all maps from N to R. Then, proceeding in the same way as in the case of 
polynomial rings, the ring R[_X]] of formal power series in one variable X over 
R is obtained. Its elements can be written as infinite series of type )>7°, aX". 


Exercises 


1. Verify the relations 0-a =0 and (—a)-b = —(a-b) for elements a,b of a ring R. 


2. The polynomial ring R[X] has been defined over commutative rings R. To what 
extent is it possible and makes sense to consider polynomial rings within the 
context of not necessarily commutative rings? 


3. Explicitly work out Euclidean division in Z[X], as specified in Proposition 4, for 
the following polynomials: 
(i) f =3X54+2K4—X343xX2-4K47, g=X*-2X +1. 
(ii) f= X54 X4-5X34+2X249K-1, g=X?-1. 
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4. Let K be a field and g € K[X] a polynomial in one variable of degree d > 0. 
Prove the existence of the so-called g-adic expansion: Given f € KX], there are 
unique polynomials ag, a1,... € KX] of degree < d, where a; = 0 for almost all 
i, such that f = 0; aig’. 


5. Let R be a ring containing a nilpotent element a € 0; nilpotent means that there 
is some n € N such that a” = 0. Show that the group of units R* is a proper 
subgroup of the group of units (RLX])*. 

6. Determine the smallest subring of R containing Q and V2, and show that it is in 
fact a field. 


7. Let R be a ring. Show that a formal power series )>a;X' € RIX] is a unit if 
and only if ao is a unit in R. 


8. Prove that the Hamiltonian quaternions H from Example (2) form a skew field. 


2.2 Ideals 


Ideals of rings are basic in ring theory, just as normal subgroups are in group 
theory. However, an ideal is not necessarily a subring of its ambient ring, due 
to the fact that in nontrivial cases, it does not contain the unit element of 
multiplication. 


Definition 1. Let R be a ring. A subset a C R is called an ideal in R if: 
(i) a is an additive subgroup of R. 
(ii) re R,aea ra€a. 


Every ring R contains the so-called trivial ideals, namely the zero ideal {0}, 
denoted by 0 as well, and the unit zdeal R. These are the only ideals if R is a 
field. For given ideals a,6 of a ring R, the following ideals can be constructed: 


a+b:={a+b;aea,beEb}, 


<0o 
a> he {So aibi; a; Ea, b € b}. 


i=1 


anb:={z;2€aand ze 6}. 


Note that always a- 6 C amb. Furthermore, one can define the product of 
finitely many ideals similarly as before, as well as the sum and the intersection 
of arbitrarily many ideals. The sum 5* a; of a family of ideals (a;);er consists 
of all sums }>a;, where a; € a; and a; = 0 for almost alli € J. Fora € R, 
we call Ra := {ra; r € R} the principal ideal generated by a. More generally, 
for elements a,,...,@, € R, one can consider the ideal that is generated by 
@1,---,@n, namely 


(a1,.--,@n) = Ray +...+ Ray = {riay +... + 7nGn 3 71,---,Tn € R}. 
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It is the smallest ideal in R containing aj,...,@n, i-e., every ideal of R that 
contains the elements a,,...,@, will also contain the ideal (a;,...,a,). Analo- 
gously, we can consider the ideal generated in R by a family of elements (a;)iez 
of R, namely the ideal >), Raj. 


Definition 2. Let a be an ideal of a ring R. A family (a;)ier of elements in a 
is called a system of generators of a if a= >0,., Raj, i.e., if a coincides with 
the ideal generated by the family (a;)ier. We say that a is finitely generated if 
a admits a finite system of generators. Furthermore, a is called a principal ideal 
if a is generated by a single element, i.e., if there exists a € a such that a = (a). 
If R is an integral domain and every ideal of R is principal, then R is called a 
principal ideal domain. 


The trivial ideals of a ring are always principal, since they are generated by 
the zero element 0, resp. the unit element 1. Moreover, the subgroups of type 
mZ C Z constitute principal ideals in the ring of integers Z. These are the only 
subgroups of Z by 1.3/4. Hence, there cannot exist any further ideals in Z. Since 
Z is an integral domain, we have the following: 


Proposition 3. Z is a principal ideal domain. 


Generators of principal ideals are not unique, since they can always be 
altered by multiplication by a unit. In integral domains this is the only way to 
change generators of principal ideals: 


Remark 4. Let R be an integral domain. Then two principal ideals a = (a) and 
b = (b) of R coincide if and only if there is a unit c€ R* such that b = ca. 


Proof. If a = 6, there is no loss of generality in assuming a= 6b #0. Then bea 
and there exists an element c € R such that b = ca. Likewise, we have a € 6 
and therefore an element c’ € R such that a = c'b. Then b = ca = cc’b, and thus 


(1—cc’')b = 0. 


Since R is an integral domain and 6 is nonzero due to b # 0, we get cc’ = 1, 
and c is a unit. The reverse implication is trivial. 


Two elements a, b of a ring R are said to be associated to each other if there 
is a unit c € R* such that b = ca. Therefore we can state that two elements 
of an integral domain generate the same principal ideal if and only if they are 
associated. Note that this assertion does not extend to more general rings; cf. 
Exercise 7 in Section 2.3. 

Finally, let us consider the polynomial ring Z[X] as an example. The prin- 
cipal ideal generated by X is given by 


= DS a,X? € ZX]; a9 = 0}. 
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and the one generated by 2 is 


(2) = {> a;X' € Z[X]; a; is even for all i}, 


Since there does not exist any nonunit in Z[X] that admits 2 as well as X as 
multiples, we immediately see that 


(2,X) = {Sax € Z[X]; ao is even} 


is an ideal in ZX] that cannot be principal. In particular, Z[X] fails to be a 
principal ideal domain, although it is an integral domain by 2.1/3. 


Exercises 


1. 


Let a = (a1,...,@m) and b = (b1,...,bn) be ideals in a ring R. Specify systems 
of generators for the ideals a+ 6 and a-b. Furthermore, discuss the case of the 
ideal a b. 


. Examine in which cases the union of two ideals or, more generally, the union of 


a family of ideals of a ring R is itself an ideal. 


Let K be a field. Consider K* = K x K as the ring-theoretic product of K with 
itself, but also as a K-vector space. Compare the notions of subrings, ideals, and 
subvector spaces in this example. 


Specify single generators of the following ideals in Z: 


Let R be a ring, X a set, and Y C X a subset. Examine which of the following 
subsets of R*, the ring of all maps X —> R, gives rise to a subring or an ideal: 


M, = ea € R* ; f is constant on ¥}, 

M, = {f € R*; f(Y) =0}, 

M3 ={f € R*; f(y) £0 forallyeY}, 

M, = ei € R* ; f(y) =0 for almost all y € Vie 


Which conditions on Y ensure that we get principal ideals in the ideal cases? 


Let R be a ring. Show that 
{a € R; there is an integer n € N such that a” = 0} 


gives rise to an ideal in R (the so-called radical or nilradical of R). 


Let K be a field. Determine all ideals of the ring of formal power series Kk [LX]. 
(Use Exercise 7 from Section 2.1.) 
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The notion of a group homomorphism extends naturally to the setting of rings. 


Definition 1. Let R and R’ be rings. A map py: R —> R’ is called a ring 
homomorphism ?f: 

(i) y(a+b) = y(a)+ ¢(b) for all a,b € R, i.e., y is a group homomorphism 
with respect to addition. 

(ii) pla b) = v(a)- v(d) for all a,b € R and y(1) = 1, we., p is a monoid 
homomorphism with respect to multiplication. 


It is verified without difficulty that the composition of two ring homomor- 
phisms yields a ring homomorphism. A ring homomorphism vy: R —> R’ is 
called an isomorphism if yp admits an inverse, i.e., if there is a ring homomor- 
phism 7: R’ —> R satisfying wo y = idg and yow = idp. This is equivalent 
to the fact that y is bijective. Injective (resp. surjective) ring homomorphisms 
R — R’' are also referred to as monomorphisms (resp. epimorphisms). An en- 
domorphism of R is a homomorphism R — R, and an automorphism of R is 
an isomorphism R —> R. 


Remark 2. Let y: R — R' be a ring homomorphism. Then: 
(i) kery = {a € R; y(a) = 0} ts an ideal in R. 
(ii) imp = y(R) is a subring of R’. 
(iii) y induces a group homomorphism R* —+ R™ between the multiplicative 
groups of units in R and R’. 


The assertions can be verified straightaway. Note that the image of a ring 
homomorphism y: R —> R’ will generally fail to be an ideal in R’. If R and 
R’ are fields, ring homomorphisms R —+ R’ will also be referred to as field 
homomorphisms. 


Remark 3. Let K be a field and Ra ring, R #0. Then every homomorphism 
yp: K — Ris injective. In particular, every homomorphism of fields is injective. 


Proof. We know that ker y is an ideal in K. It is even a proper ideal, since 
y(1) = 1 4 0. But then ker y = 0, since a field does not contain any proper 
ideals, except for the zero ideal. 


For an arbitrary ring R, there is precisely one ring homomorphism Z —> R, 
namely the map given by n -> n-1. Here n-1 for n > 0 is to be interpreted 
as the n-fold sum of the unit element 1 € R, and likewise for n < 0 as the 
(—n)-fold sum of —1. The inclusion map R @ R’ attached to a ring extension 
RC R’ is a (trivial) example of a ring homomorphism. Furthermore, given 
such an extension, every element x € R’ gives rise to a so-called substitution 
homomorphism 
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RIX] RY, f= Sl aiX'> fe) =) aie’, 


which is a ring homomorphism. Indeed, the process of evaluating polynomials 
f,g © RLX] at elements x € R’ was already dealt with in 2.1, including the 
compatibilities (f + g)(x) = f(x) + g(x) and (f+ g)(x) = f(x)- g(x) that are 
required for a ring homomorphism. 

In the following, let R be a ring and a an ideal in R. We want to adapt the 
construction of the factor group G/N of a group G by a normal subgroup N to 
the setting of rings and construct a so-called factor ring or residue class ring 
R/a, together with a surjective ring homomorphism 7: R —> R/a satisfying 
ker 7 = a. To begin with we define R/a as an abelian group, just by viewing 
a as a (normal) subgroup of the additive group of R. Then R/a consists of all 
residue classes of type « + a for x € R, where addition in R/a is given by the 
formula 

(c+a)+(y+a)=(x+y)+a. 


That this law of composition is well defined and makes R/a an abelian group 
was shown in 1.2. Proceeding similarly with the multiplication, we define for 
residue classes x + a, y+ ain R/a their product by 


(cw +a)-(yt+a):=(a-y) +a. 


Of course, we have to check that the product is well defined or, more precisely, 
that the residue class (x-y)-++a is independent of the choice of the representatives 
x,y of the residue classes x + a and y+ a. To do this, assume z’+a=ax+a 
and thus x’ = «+ a for some a € a, as well as y’+a=y+aand y’=y+6 for 
some b € a. Then a’y’ = xy + ay’ + xb € (xy) +4, and we see that 


(xy) +a = (2’y’) +4. 


Therefore, the multiplication is well defined on R/a, and it is immediately clear 
that the ring properties of R carry over to R/a. Moreover, the canonical pro- 
jection 

a: R—+ R/a, ur ata, 


is aring homomorphism satisfying ker 7 = a, which, similarly as in 1.2/6, admits 
a universal property: 


Proposition 4 (Fundamental theorem on homomorphisms). Let y: R — R! 
be a ring homomorphism and ac R an ideal satisfying a C ker py. Then there 
exists a unique ring homomorphism G: R/a —> R' such that the diagram 


is commutative. Furthermore, 


2.3 Ring Homomorphisms, Factor Rings 37 


imG=imy, ker @ = z(ker vy), ker p = 1 ‘(ker ). 
In particular, DB is injective if and only if a= ker y. 


Corollary 5. /f »: R —> R' is a surjective ring homomorphism, then R' is 
canonically isomorphic to R/ ker y. 


To do the proof of Proposition 4 we apply 1.2/6 to the additive group of R. 
Then it remains only to check that the group homomorphism @: R/a —> R’ 
obtained from 1.2/6 is in fact a ring homomorphism. However, since @ is char- 
acterized by the equation 


this assertion is clear. 


Also note that the isomorphism theorems 1.2/8 and 1.2/9, which were ob- 
tained as corollaries to the fundamental theorem on homomorphisms 1.2/6, 
carry over without difficulty from the setting of groups to the present setting of 
rings, either in a direct way or as a consequence of Proposition 4; just replace 
normal subgroups by ideals. 

The rings Z/mZ viewed in 1.3 as abelian groups are natural examples of 
residue class rings. In particular, form > 0 we see that Z/mZ is a ring consisting 
of m elements. 


Proposition 6. The following conditions are equivalent for m € Z,m > 0: 
(i) m is a prime number. 
(ii) Z/mZ is an integral domain. 


(iii) Z/mZ is a field. 


Proof. For any x € Z, let us denote by % € Z/mZ the attached residue class 
modulo mZ. To begin with, assume condition (i), i-e., that m is a prime number. 
Then m > 1 and Z/mZ is nonzero. Now if @- b = 0 for two integers a,b € Z, 
then ab € mZ, and we see, for example by looking at the prime factorizations 
of a, b, and ab, that m divides a or b. Thereby we get a € mZ or b € mZ, i-e., 
a= 0 or b=0, and Z/mZ is an integral domain, as required in (ii). 

Next we can conclude for every @ € Z/mZ—{0} from (ii) that the map 


Z/mZ —> Z/mZ, TH>G-T, 


is injective and, in fact, bijective, since Z/mZ is finite. In particular, the unit 
element 1 of Z/mZ belongs to the image of this map, and it follows that @ admits 
a multiplicative inverse in Z/mZ. But then Z/mZ is field, and we get (iii). 
Finally, assume that Z/mZ is a field as in (iii) or, more generally, an integral 
domain. This implies Z/mZ # 0 and therefore m > 1. To show that m is prime, 
consider a divisor d € N of m and an equation m = da. Then d-@ = 0, and 
we get d = 0 or @ = 0, since Z/mZ is an integral domain. In the first case, m 
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divides d, and hence d = m. In the second, m divides a, so that a = m and 
d= 1. Thus, m admits only itself and 1 as divisors and therefore is prime. 


In particular, we have seen that the ring Z/pZ, for a prime p, is a field 
consisting of p elements; it is denoted by F,. More generally, using elemen- 
tary number theory, one can show for integers m > 1 that the group of units 
(Z/mZ)* consists of all residue classes @ such that a € Z is relatively prime 
to m. Next we want to view the assertion of Proposition 6 in a more general 
context. 


Definition 7. Let R be a ring. 

(i) A proper ideal p ¢ R is called a prime ideal if ab € p for elements 
a,be Rimplesacep or bep. 

(ii) A proper ideal m © R is called a maximal ideal if m Cac R for an 
ideal a C R implies a=m or a= R. 


For example, the zero ideal of a ring R is prime if and only if R is an integral 
domain. 


Proposition 8. Let R be a ring. 
(i) An ideal p C R is prime if and only if R/p is an integral domain. 
(ii) An ideal m C R is maximal if and only if R/m is a field. 
In particular, every maximal ideal is prime. 


Proof. First of all, note that p is a proper ideal in R if and only if the residue 
class ring R/p is nonzero, similarly for m. Now assertion (i) is easy to verify. 
Look at residue classes @, b € R/p of elements a,b € R. Then 


a-bep = acéporbep 


is clearly equivalent to 


a@-b=0 aZ=Oorb=0. 


Furthermore, assertion (ii) is a consequence of the following two lemmas: 


Lemma 9. An ideal m C R is maximal if and only if the zero ideal 0 C R/m 
is maximal. 


Lemma 10. The zero ideal 0 C R of a ring R is maximal if and only if R is 
a field. 


Proof of Lemma 9. Let 7: R —> R/m be the canonical projection. It is easily 
checked that the mappings 

RD a + +n(a)C R/m, 

RD 1(6)+4 6 CR/m, 
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define a bijection between all ideals a C R such that m Cac R and the ideals 
6 C R/m. The stated equivalence is an immediate consequence of this fact. 
Alternatively, the assertion can be verified in a more direct way. Indeed, 
recall that m is a proper ideal in R if and only if the residue class ring R/m 
is nonzero. Now, a proper ideal m C R is maximal if and only if we have 
m+ Ra = R for each a € R—m, ie., if and only if for each such a there 
exist elements r € R and m € m such that ra +m = 1. Using the projection 
a: R —> R/m, this condition is true if and only if each @ € R/m—{0} admits 
a multiplicative inverse 7 € R/m satisfying 7 -a@ = 1, i-e., if and only if, finally, 
the zero ideal is maximal in R/m. 


Proof of Lemma 10. Assume that the zero ideal 0 C R is maximal and consider 
an element a € R—{0}. Then aR = R, and there exists an element b € R such 
that ab = 1. Thus R* = R— {0}, and R is a field. Conversely, that the zero 
ideal of a field is maximal is immediately clear. 


Propositions 6 and 8 give a complete overview of prime and maximal ideals 
in Z: 


Corollary 11. An ideal in Z is prime if and only if it is of type pZ for a prime 
number p or for p=0. An ideal in Z is maximal if and only if it is a nonzero 
prime ideal. 


Just use the fact that Z is a principal ideal domain by 2.2/3 and that the 
zero ideal of an integral domain is prime. To end the present section we want 
to establish the so-called Chinese remainder theorem. 


Proposition 12. Let R be a ring and ay,...,a4, C R pairwise coprime ideals, 
i.e., assume that a; +a; = R for i #4 j. Then, writing 1;: R —> R/a; for the 
canonical projections, the homomorphism 


yp: R— R/a, x... xX R/O, rH (m1(z),...,%n(2)), 


is surjective and satisfies kerp = a, M...M dn. In particular, it induces an 
isomorphism 


R/C) oi TP R/a. 
i=1 i=l 


where [[i_, R/a; = R/a, x... R/ay is the ring-theoretic product of the residue 
class rings R/aj. 


Proof. To begin with, let us show for j = 1,...,n that the ideals a; and Niey a; 
are coprime in the sense that their sum yields R. To do this, fix an index j. 
Since a; and a; are coprime by assumption for each i # j, there are elements 
a; € a; and aj € a; such that a; + a} = 1. Hence, we get 
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1=|[(i+a))€a;+]Jaca+fai 


ifj ifj ifj 
and therefore a; + (),z; 4: = R, as claimed. 


In particular, there exist equations d; +e; = 1 for 7 = 1,...,n and elements 
dj € aj, ey €f),z, ai, and we see that 


ji for i=yj, 
mle) ={ 4 for iF j. 


This shows that y is surjective. Indeed, look at an element y = (y1,.--,Yn) in 
R/a, x ... x R/a, and choose a 7;-preimage x; € R of y; for each 7. Then 


(> nei) = Y. 
i=1 


Finally, the assertion about the kernel of y is trivial, and the stated isomorphism 
is readily derived from the fundamental theorem on homomorphisms. 


If a is an ideal of a ring R, two elements x,y € R are said to be congruent 
modulo a, written x = y mod a, if x and y give rise to the same residue class 
in R/a, ie., if 2 — y € a. In the case that a is a principal ideal Ra, one also 
writes “mod a” instead of “mod a.” Using such terminology, the surjectivity of 
the map y in Proposition 12 can be expressed as follows: given x1,...,%n € R, 
there exists an element « € R such that + = x; moda; fori = l,...,n. 
Consequently, for the ring of integers Z, the Chinese remainder theorem takes 
the following shape: 


Corollary 13. Let a1,...,a, € Z be integers that are pairwise relatively prime. 
Then the system of simultaneous congruences « = x; moda,;,i = l,...,n, 
is solvable for arbitrary integers 2%1,...,%n € Z, and the solution x is unique 
modulo a,-+...+ Gy. Therefore, the set of all solutions forms a residue class of 
type © + a,-...+ GyZ. 


Of course, it has to be checked that relatively prime integers a, a’ € Z satisfy 
the equations 


(a,a’)=(1) and (a-d’) =(a)N (a); 


for details see 2.4/13. Also note that the proof of the Chinese remainder theorem 
provides a constructive method for solving systems of simultaneous congruences. 
Indeed, one determines integers dj € (aj), e; € ([],4; ai) for 7 = 1,...,n that 
satisfy dj; + e; = 1, for example using EFuclid’s algorithm; see 2.4/15. Then 
xz = 7, vie; is a solution of the system x = x; mod a,;, i = 1,...,n, and 
all other solutions are obtained from this special one by adding a multiple of 


Tia Qj. 
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Exercises 


1 


Consider a ring homomorphism yp: R — R' and look for valid assertions about 
the images of ideals a C R, as well as on the preimages of ideals a’ C R'. Examine 
the same question also for prime and maximal ideals. 


For an element x of a ring R, consider the substitution homomorphism 
pr: R[X] — R, Sa X* > So aja, 


and describe the kernel of pz. In particular, discuss the cases in which it is a 
prime ideal, resp. a maximal ideal in R[X]. 


Generalize the isomorphism theorems 1.2/8 and 1.2/9 to the setting of rings, by 
considering rings instead of groups and ideals instead of normal subgroups. 


Let y: R —> R' be a ring homomorphism. Show for an element x € R’ that 
there is precisely one ring homomorphism ©: R[X] —> R’ satisfying ®|p = y 
and ®@(X) = z. In particular, the set of ring homomorphisms ®: R[X] —> R’ 
such that &| = ¢ is in one-to-one correspondence with the set of elements of R’. 


Let R be an integral domain and @: R[X] —> R[X] a ring homomorphism 
satisfying ®|p = idg. Show that @ is an automorphism if and only if there are 
elements a € R* and b € R such that &(X) = aX +b. 


Let p be a prime ideal of a ring R. Show that pR[X], the ideal generated by p 
in RX], is a prime ideal. 


Let & be a field and KLX,Y] = K[X][Y] the polynomial ring in two variables 
X and Y over K. Consider the residue class ring R = K[X,Y]/(XY?) and 
denote by X, resp. Y, the corresponding residue classes of X, resp. Y. Show that 
the elements X and X + X -Y generate the same principal ideal in R, although 
they are not associated. Hint: Look at the ideal consisting of all elements f € R 
such that f -X = 0, resp. at the ideal of all elements f €¢ K [X,Y] such that 
fX € (XY). 


Let R be a ring. Show that {> a;X* € RLX]; a, = 0} is a subring of RLX] and 
that it is isomorphic to RLX][Y]/(X? — Y°). 


2.4 Prime Factorization 


Basic properties of the ring of integers Z, and the polynomial ring K LX] over a 
field K, are related to the fact that these rings admit a division with remainder, 
which is called Euclidean division. We want to look at general integral domains 
admitting such a division process and show that they belong to the class of 
principal ideal domains. For principal ideal domains, in turn, we will prove the 
existence and uniqueness of prime factorizations. 


Definition 1. An integral domain R is called a Euclidean domain if it admits a 
map 6: R—{0} —+ N making possible Euclidean division in R in the following 
sense: 
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Given elements f,g € R, g #0, there are elements q,r € R such that 
f=aqg+r, where d(r) <d(g) or r=0. 
The map 6 is referred to as a Euclidean function of the Euclidean domain R. 


Every field is a Euclidean domain for trivial reasons, but there are some 
more interesting examples. 


(1) Z is a Euclidean domain under the usual division with remainder. The 
map 0: Z—{0} —>+ N, at |al, serves as a Euclidean function. 


(2) The polynomial ring K[X] over a field K is a Euclidean domain under 
the usual polynomial division introduced in 2.1/4; the map 6: K[X]-{0} — N, 
f > deg f, serves as a Euclidean function. 


(3) The ring of Gaussian integers Z[t] := {x + iy; x,y € Z} C Cisa 
Euclidean domain, with Euclidean function 


6: Z[i]—{0} —N, gtiy oa? +y’ = |x + iy|. 


In order to characterize the division with remainder in Z[i], observe that the 
distance between adjacent points in Z[i] is at most 2. Thus, given f,g € Z[i], 
g # 0, there exist integers x,y € Z such that |fg~! — (x + iy)| < - -f2 <1. 
Now setting g := (a + iy) and r := f — qg, we get |r| < |g| and therefore 


f=ag+r, where d(r) < d(g) orr=0. 


(4) Let d 4 0,1 be a square-free integer, which means that d € Z does not 
admit a square of an integer > 1 as a divisor, and consider the following subring 
of C: 

_ JZ+v4d-Z, if d=2,3 mod 4, 
“~ \Z+i(14Vd)-Z, ifd=1 mod 4. 


For d = —1 we obtain the ring of Gaussian integers, as discussed before. The 
rings Ry are of special interest in number theory. One would like to know 
whether Ry, depending on d, is a unique factorization domain, i.e., whether 
the elements of Rg admit unique prime factorization. Since a Euclidean domain 
is a principal ideal domain and a principal ideal domain is a unique factoriza- 
tion domain, see Proposition 2 and Corollary 11 below, one starts by looking 
at values d such that Ry, is Euclidean. As Euclidean function 6: Ry—{0} —> N 
one may try the so-called norm map given by 6(a + bVd) = |a? — bd]; see 
Section 4.7 for details on the norm. It can be shown that the norm map is a 
Euclidean function on Rg precisely for the following values of d: 


fea ee ee ey 
d = 2,3,5,6, 7, 11, 13, 17, 19, 21, 29, 33, 37, 41, 57, 73. 
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In particular, Rg is Euclidean and thus a unique factorization domain in these 
cases. Moreover, it is known for d < 0 that Ry is a unique factorization domain 
in precisely the following additional cases: 


d = —19, —43, -67, —163. 


On the other hand, there are several values d > 0 for which Rg is known to be 
a unique factorization domain, but not necessarily a Euclidean domain; see, for 
example, H. Hasse [7], $16.6. 


Proposition 2. Every Euclidean domain is a principal ideal domain. 


Proof. Proceeding as in 1.3/4, let a C R be an ideal, where we may assume 
a # 0. Choose an element a of a—{0} such that 6(a) is minimal with respect 
to the Euclidean function 6 considered on R. We claim that a = (a). Indeed, 
let f € a and decompose it in terms of Euclidean division, say f = qa +7, 
where d(r) < d(a) or r = 0. Then we get r = f — ga € a. However, due to the 
minimality of d(a), we must have r = 0 and hence f = ga € (a). This shows 
that a C (a). Since the reverse inclusion is trivial, we get a = (a), and a is 
principal. 


Corollary 3. The rings Z, Zi], as well as the polynomial ring K |X] over a 
field K, are Euclidean domains and hence principal ideal domains. 


Next we want to study prime factorizations in principal ideal domains. For 
elements x and y of an integral domain R we say that x divides y, writing x|y, 
if there is an element c € R such that ca = y, or equivalently, if y € (x). If such 
is not the case, i.e., if « does not divide y, we write xfy. 


Definition 4. Let R be an integral domain and p € R a nonzero nonunit. 

(i) p is called irreducible if a factorization of type p = xy with x,y ER 
implies x € R* or y € R*. Furthermore, p is called reducible if it is not 
irreducible. 

(ii) p is called a prime element if from p|xy with x,y € R we get p|x or 
ply, te., in other words, if the principal ideal (p) is prime. 


The irreducible elements of the ring of integers Z are given, up to sign, 
precisely by the usual prime numbers, while we can see for the polynomial ring 
K(X] over a field K that particularly the linear polynomials of type X — a 
for a € K are irreducible. Note that for AK = C there are no further irre- 
ducible polynomials, up to associatedness, i.e., up to multiplication by nonzero 
constants from K; this will be a consequence of the fundamental theorem of 
algebra, to be proved in Section 6.3. However, over a general field K, there can 
exist irreducible polynomials of higher degree, such as the polynomial X?+1 in 
R[X]. Furthermore, we will see in Proposition 6 that the irreducible and prime 
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elements coincide in principal ideal domains, for example in the ring of integers 
Z and the polynomial ring KX] over a field K. 


Remark 5. Let R be an integral domain and p € R a nonzero nonunit. 
(i) If (p) is a maximal ideal in R, then p is a prime element. 
(ii) If p is a prime element, then p is irreducible. 


Proof. If (p) is a maximal ideal in R, then it is a prime ideal as well, see 2.3/8, 
and it follows that p is a prime element. This verifies assertion (i). To establish 
(ii), assume that p is a prime element. Then, if p = xy for some x,y € R, we 
get p|x or ply, since p is prime. Assuming p|.x, there exists an element c € R 
such that pe = x and hence p = xy = pcey. Since RF is an integral domain, we 
must have cy = 1 and therefore y € R*. Thus, p is irreducible. 


For principal ideal domains, the assertions we have just proved can be sharp- 
ened substantially; see also 2.3/6. 


Proposition 6. Let R be a principal ideal domain and p € R a nonzero nonunit. 
The following conditions are equivalent: 
(i) p is irreducible. 
(ii) p is a prime element. 
(iii) (p) is a maximal ideal in R. 


Proof. In view of Remark 5, it remains to show that (i) implies (iii). Therefore, 
assume that p is irreducible and let a be an ideal in R satisfying (p) Cac R, 
say a = (a), since R is a principal ideal domain. Then there is an element c € R 
such that p = ac, and p being irreducible implies a € R* or c € R*. Hence, we 
get a = R in the first case and a = (p) in the second. This shows that (p) is a 
maximal ideal in R. 


Using the preceding result, it is quite easy to derive the existence of prime 
factorizations in principal ideal domains. In fact, it is enough to look at factor- 
izations into irreducible elements. 


Proposition 7. Let R be a principal ideal domain. Then every nonzero nonunit 
a € R is a product of prime elements.” 


Proof. Fix an element a € R—(R* U {O}). If a is irreducible (and thereby 
prime), nothing has to be shown. Otherwise, decompose a into the product bc 
of two nonunits in R. If one of the factors b and c is not yet irreducible, we can 
further decompose b or c and so forth. To obtain the desired assertion of the 
proposition it remains only to show that the procedure stops after finitely many 
steps and thus yields a factorization of a into irreducible and thereby prime 
factors. Concerning the rings that are of special interest to us, such as Z and 


? A product of elements of a ring is always meant to be a finite product. 
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K(X] for a field K, this is immediately clear. Indeed, in Z we have |b], |c| < |a| 
for a factorization of a into nonunits b,c. Similarly, we get deg b, deg c < dega in 
K(X], as follows from 2.1/2. Hence, recursively decomposing a into a product of 
nonunits, the absolute value, resp. the degree of factors, strictly decreases every 
time we further decompose a factor into a product of nonunits. Therefore, it 
becomes clear that the procedure must stop after finitely many steps. 


Cc 


However, in order to establish Proposition 7 in its full generality, we add 
here a general argument, valid for principal ideal domains R, that implies the 
existence of factorizations of a into (finite) products of irreducible elements. 
The following auxiliary assertion is needed: 


Lemma 8. Every principal ideal domain R is Noetherian, 1.e., every ascending 
chain of ideals a, C dg C... C R becomes stationary in the sense that there is 
some n EN such that a; =a, for alli > n. 


The assertion is easy to verify. Since the union of an ascending chain of 
ideals is itself an ideal, we can consider a = U),., a; as an ideal in R; it is a 
principal ideal, say a = (a). However, since a € a, there is some n € N such 
that a € a,, and we get (a) C a, C a = (a). As a result, the chain of ideals 
a; C do C... becomes stationary at ay. 

Now we can prove Proposition 7 for general principal ideal domains R. Let 
S be the set of all principal ideals in R admitting a generator a € R-(R* U {0}) 
such that a does not allow a finite factorization into irreducible elements. We 
have to show that S = Q. Assuming S 4 0, we conclude from Lemma 8 the 
existence of a maximal element in S, i.e., of an element a € S' such that every 
strict inclusion a ¢ 6 of ideals in R implies that 6 cannot belong to S. Now let 
a = (a) be such a maximal element of S. Then the generating element a must 
be reducible, say @ = a a2 for nonunits a,,a2 € R. As a consequence, we get 
strict inclusions 


(a)S(m), (a) S (a), 


and we see that (a,) and (a2) cannot belong to S. In particular, a; and az admit 
factorizations into irreducible elements, and the same is true for the product 
@ = 41d, in contradiction to (a) € S. Therefore, S = 0, and the assertion of 
Proposition 7 is clear. 


Next we want to show that prime factorizations as considered in Proposi- 
tion 7 are essentially unique. 


Lemma 9. Let R be an integral domain. For an element a € R, consider 
factorizations 


G=Pi.--Pr=N.---s 


into prime elements p; and irreducible elements q;. Then r = s, and one can 
renumber the q; in such a way that p; is associated to q; for i=1,...,r. 
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Proof. Since p;|q...ds and p; is prime, there exists an index j such that pj | q;. 
Renumbering the qg;, we may assume 7 = 1. Hence, using the fact that q is 
irreducible, there is an equation q, = €,p; for a unit ¢,, and we can conclude 
that 


P2.--Pr = €142---s- 


Continuing inductively, the desired assertion follows. 


Proposition and Definition 10. For an integral domain R, the following 
conditions are equivalent: 

(i) Every element a € R—(R* U {0}) can be uniquely written, up to associ- 
atedness and order, as a product of irreducible elements. 

(ii) Every element a € R—(R* U {0}) is a product of prime elements. 

An integral domain R satisfying the equivalent conditions (i) and (ii) is 
called a factorial ring or a unique factorization domain. Alternatively, we say 
that the elements of R admit unique prime factorization. 

An element of a unique factorization domain is irreducible if and only if it 
is prime. 


Proof. First, assuming condition (i), we want to show that every irreducible 
element of R is prime. To do this, let a € R be irreducible and fix x,y © R such 
that a| ay. We have to show that a|x or a|y, where we may assume that x 
and y are nonunits. Now let «= 2,...2%, andy = y,...ys be factorizations into 
irreducible elements as provided by (i). Then we get a|(a1...%,-Y1.--Ys), and 
the uniqueness assertion of (i) implies that a, being irreducible, is associated to 
one of the elements 2;, y;. In particular, we get a|x or a|y. Hence, a is prime. 

As a by-product, this argument settles the implication from (i) to (ii), while 
the reverse follows from Lemma 9. Indeed, due to Remark 5, any factorization 
into prime elements is a factorization into irreducible elements. 

Finally, if R is a unique factorization domain, it satisfies condition (i), and 
thus all irreducible elements are prime. The converse of this follows from Re- 
mark 5 again. 


Now the assertion of Proposition 7 can be phrased in a new way: 
Corollary 11. Every principal ideal domain is a unique factorization domain. 


Fields are unique factorization domains for trivial reasons. But also the 
rings Z, Z[i], as well as the polynomial ring K [|X] over a field K, are unique 
factorization domains, since they are Euclidean and thus principal ideal do- 
mains. We will show in 2.7/1 that the polynomial ring R[X] over a unique 
factorization domain R is itself a unique factorization domain. For example, we 
thereby see that the polynomial ring Z[X] is a unique factorization domain, 
although it is not a principal ideal domain. The same holds for the polynomial 
ring K [X,Y] := K[LX][Y] in two variables X and Y over a field K. 
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For unique factorization domains R, it is quite common to write a product 
of associated prime elements as a power of one of them, of course adjusted by 
a unit. In this way, one considers prime factorizations of type 


= Vy VY 
a= eEp 1 Pr’ s 
for a unit €, certain exponents ,...,/-, and pairwise nonassociated prime 


elements p,,...,p-. Then, formally speaking, every element a € R—{0} admits 
such a prime factorization (with exponents v; = 0 if a is a unit). To further 
standardize prime factorizations, one can fix a system P of representatives of 
the prime elements in R, i.e., a subset P C R that contains precisely one 
element from each class of mutually associated prime elements. In this way, 
prime factorizations in R take the form 


Q=e Ile. 


peP 


where now ¢ € R* and the exponents v,(a) € N are unique. Of course, we have 
V,(a) = 0 for almost all p € P, so that the product is actually finite. For the 
ring of integers Z it is common to define P as the set of all (positive) prime 
numbers, while in the polynomial ring K LX] over a field kK one takes for P the 
set of all monic irreducible (or prime) polynomials, i.e., the set of all irreducible 
polynomials whose leading coefficient is 1. 

In the following we want to discuss the notions of the greatest common 
divisor and the least common multiple, which are often used within the context 
of unique factorization domains. To do this, consider an integral domain R and 


fix elements x1,...,2%, € R. An element d € R is called a greatest common 
divisor of @1,..., Xp if: 

(i) dla; for? =1,...,n, ie., dis a common divisor of all the 2;. 

(ii) If a € R is a common divisor of the aj, i-e., if a|a; fori = 1,...,n, then 
ald. 

Such a greatest common divisor d is unique up to associatedness, if it exists, 
and we will write d = gced(a,,...,v,). If d= 1, then x,,...,2, are said to be 
coprime. 

An element uv € R is called a least common multiple of 21,...,%n if: 

(i) a;|v for? =1,...,n, ie, v is a common multiple of all the x;. 

(ii) If a € Ris a common multiple of the x;, ie., if 2; |a for 7 = 1,...,n, 
then v|a. 

Similarly as before, such a least common multiple v is unique up to asso- 
ciatedness, if it exists, and we write v = lcm(21,...,2,). As usual, one proves 


the following result: 


Proposition 12. Let R be a unique factorization domain and let P be a system 
of representatives of all prime elements in R. If 


a, =e: || pe, § = yee eng 
pEP 
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are prime factorizations of elements x1,...,%, € R, then ged(a1,...,%,) and 
lem(21,...,2) exist and are given by 
ged(z1, coer) Ln) _ (eee 
peEP 
lem(21, ee In) = eee, 
pEeP 


up to associatedness. 


In principal ideal domains the greatest common divisor and the least com- 
mon multiple can be characterized in terms of ideals: 


Proposition 13. Let 11,...,2%, be elements of an integral domain R. 

(i) If (a1,...,2%n), the ideal generated by the x; in R, is principal, say gen- 
erated by an element d € R, then d= gcd(ay,..., Up). 

(ii) If (a)... .A(an) ts a principal ideal, say generated by an element v € R, 
then v = lem(a1,...,2n). 


Proof. (i) Assume (21,...,%,) = (d). Then we get x; € (d) and therefore d| x; 


for all t. Moreover, due to d € (2,...,%,), there is an equation d = 7", a;x; 
with suitable coefficients a; € R. In particular, every common divisor of the 2; 
is a divisor of d as well, which shows that d= gced(a1,..., 2p). 


(ii) Assume ()j_,(a;) = (v). Then v belongs to each of the ideals (a;) and 
hence is a common multiple of all the x;. Now if a is another common multiple 
of the 2;, we have a € (2;) for all i and hence a € ();_, (xi) = (v). This means 
that v|a, and we get v = lem(a1,...,2n). 


The above characterization of the greatest common divisor and the least 
common multiple in terms of ideals can be used to derive the following special 
version of the Chinese remainder theorem 2.3/12: 


Corollary 14. Let R be a principal ideal domain and a = ep}! ... pk" a prime 
factorization of some element a € R, where € is a unit and the prime elements 
Dp; are pairwise nonassociated. Then the Chinese remainder theorem 2.3/12 pro- 
vides a canonical isomorphism 


R/(a) = R/ (pi) , < Rf pe). 


Proof. Using Proposition 13 in conjunction with Proposition 12, the ideals 
(p{'),---,(py") are pairwise coprime in R, since ged(p;",p/’) = 1 for 7 # 7. 
Likewise, we have (a) = ();_, (pi), since a = Icm(py', .. . , p¥r). 


For Euclidean domains R, there exists a constructive process to determine 
the greatest common divisor of two given elements x,y € R, the so-called Eu- 
clidean algorithm. Applying this algorithm iteratively and using relations of 
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type gcd(x, y, z) = ged(ged(z, y), z), the process can even be used to determine 
the greatest common divisor of any number of elements in R. 


Proposition 15 (Euclidean algorithm). Let R be a Euclidean domain. For two 
elements x,y € R—{0} consider the sequence z, 21,... € R, which is inductively 
given by 


20 = 2, 
A=Y; 
i remainder of z;_, with respect to division by 2; if z; 4 0, 
S41 = 


0 otherwise. 


Then z; = 0 for almost all i € N. Furthermore, 2, = gcd(x,y), where n € N is 
the smallest index satisfying Zn41 = 0. 


Proof. Let 6: R—{0} —>+ N be the Euclidean function considered on R and fix 
an index i > 0 such that z; 4 0. According to the definition of the sequence 
20, 21,---, there is an equation of type 


2-1 = G2 + Zi41, 


where 6(z:41) < 6(z) or 2%+1 = 0. Therefore, the sequence of integers 6(z;) is 
strictly decreasing for i > 0, at least as long as z; is nonzero, and thus 6(z;) is 
defined. In particular, z; will be nonzero only for finitely many indices 7 € N, 
and there is a smallest index n € N such that z,4, = 0. Then n > 0, since 
zy #0 2. Now consider the equations 


(Eo) 2% =q% + 22, 
(En—2) 2n—-2 = In-1%n-1 + Zens 
(En-1) 2n-1 = dn2n- 


We get Zn | 2n-1 from (Ep_1), and furthermore 2, | z,-2 from (E,_2), and so 
forth, until we end up with z, | 2: and z,| 2. In particular, z, is a common 
divisor of x and y. If a € R is another common divisor of x and y, we get a| z2 
from (Ey), and furthermore a| z3 from (E,), and so forth, until we finitely obtain 
a| Zn. Hence, Zp is the greatest common divisor of x and y, as claimed. 


Not only does the Euclidean algorithm make it possible to determine the 
greatest common divisor d of two elements x,y in a Euclidean domain; beyond 
this, it yields a representation of this divisor as a linear combination d = ax+by. 
Indeed, in the above proof we get from (E,,») a representation of d = z, as a 
linear combination of z,_2, 2,1, and furthermore using (E,_3), as a linear com- 
bination of 2,3, Zn-2. Continuing like this, (Eo) finally leads to a representation 
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of d as a linear combination of x = z and y = 2. Let us add that such a rep- 
resentation is needed when one is explicitly solving simultaneous congruences; 
see 2.3/13 and the explanations following it. Of course, the mere existence of 
such solutions was already established in Proposition 13 within the context of 
general principal ideal domains. 

Finally, let us refer to some applications of the results dealt with in the 
present section. We can once more conclude from 2.3/8 and Proposition 6 that 
the residue class ring Z/pZ for an integer p € Z, p > 0, is a field if and only 
if p is a prime number. Likewise, for a field K and a polynomial f € K[X], 
the residue class ring L = K[X]|/(f) modulo the principal ideal generated by 
f is a field if and only if f is irreducible. Furthermore, it is easily seen (cf. 
the proof of 3.4/1) that the residue class of X in L becomes a zero of f. Just 
view Ky as a subfield of Z via the canonical homomorphism kK —> L (which 
is injective by 2.3/3) and, similarly, f as a polynomial with coefficients in L. 
Later, in 3.4/1, we will use this construction, which goes back to L. Kronecker, 
in order to construct, for a given polynomial f € K[X]—K that does not admit 
zeros in K, an extension field Z such that f acquires a zero in L. To give some 
simple examples, consider the canonical isomorphism 


R[X]/(X?+1)<C, 


obtained by applying the fundamental theorem on homomorphisms to the sub- 
stitution homomorphism 


R[X] — C, So an X" > Sani”, 
which maps X to the complex number 7. In a similar way one shows that 
R[X]/(X — a) ~R 


for arbitrary a € R. 


Exercises 


1. Determine all rings R such that the polynomial ring RLX] is a principal ideal 
domain. 


2. For principal ideal domains, we can conclude from Proposition 13 that the greatest 
common divisor as well as the least common multiple of two elements can be 
characterized in terms of ideal theory. Check whether the same is true for unique 
factorization domains. 


3. Prove that the subring R = Z+./—5-Z C C is not a unique factorization domain. 
To do this, consider the factorizations 6 = 2-3 = (1+ /—5)- (1 — /—5) and 
show that the elements 2, 3, (1 + /—5), (1 — /—5) are irreducible and pairwise 
nonassociated. Check whether the stated elements are prime. 


4, Let K be a field and R = K[X][Y]/(X? — Y?) the integral domain from Ex- 
ercise 8 in 2.3. Show that the residue classes X and Y of X,Y € K[X][Y] are 
irreducible, but not prime. 
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5. Let G be a cyclic group of finite order. Show for elements a,b € G that the 
subgroup generated by a and 6 in G is of order lcm(ord a, ord b). 


6. Show that 2 = (1 + 7)(1 — 2) is the prime factorization of 2 in Z[i]. 


7. Use the Euclidean algorithm to determine the greatest common divisor of the 
following polynomials in Q[X]: 


fH=XP4 X74 X-3, g = X®— X° 4 6X? —13X +7. 


8. Determine all irreducible polynomials of degree < 3 of the polynomial ring F2[X], 
where F9 is the field consisting of two elements. 


9. For a prime number p € N, consider the following subset of the field Q of rational 
numbers: 


Zp := {0} U iF € Q; x,y € Z—{0} such that v,(x) — vp(y) > of. 


Show that Z, is a subring of Q, a principal ideal domain, but not a field. Specify 
all units as well as all prime elements of Zp. 


10. Show that a ring R is Noetherian (in the sense that every ascending chain of 
ideals ay C ag C ... C R becomes stationary) if and only if every ideal in R 
admits a finite system of generators. 


2.5 Polynomial Rings in Several Variables 


In 2.1 we introduced the polynomial ring R[LX] in one variable X over a ring R. 
Iterating the construction, we could define the polynomial ring in n variables 
X1,...,Xp over R: 


RX =o (BRO RS)) JOEL. 


However, a more elegant way is to generalize the definition of 2.1 such that it 
applies to the case of several variables. In fact, we will define for a commutative 
monoid M the “polynomial ring” R[M] in such a way that we can interpret 
M as the (multiplicative) monoid of all “monomials” in R[M]. In doing so, 
the polynomial ring R[_X] in one variable X is obtained by taking M =N, the 
polynomial ring R[X),...,X,] in n variables Xy,...,X, by taking M = N”, 
and the polynomial ring R[X] in a family of variables = (X;)icr, indexed by 
an arbitrary index set I, by taking M = NY). In each case we take on N, N", 
and N“) the (componentwise) addition as the law of composition. 

In the following let M be an arbitrary commutative monoid whose law of 
composition is written additively. Then we define R[M] by 


RUM] = RO) = {(dy)yem 3 4, € R, a, = 0 for almost all py}, 
together with the laws of composition given by 


(Gy) wem + (Oy) wem = (a, + bu) peM (Qu) wem . (Oy) wem = (Cu) we, 
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where 


Cy = S> ay: by. 


At+v= 


It is verified without difficulty that R[1/] becomes a ring under these laws. 
In particular, for the monoid of natural numbers M = N we rediscover the 
polynomial ring RLX] in one variable X, as defined in 2.1. However, also in the 
remaining cases we can use a polynomial notation for the elements of R[M]. 
Indeed, for x € M consider X“ := (6,,.)yem as an element of R[M], where 6,,y 
is Kronecker’s symbol, which is given by 6,,, = 1 for 4 = A and 6,,, = 0 for 
pu # A. We call X" the monomial in R[M] that is attached to yw. Using this 
notation, the elements of R[M] can be written as sums of type ae, ayX", 
where the coefficients a, € R are unique and, of course, zero for almost all 
uu € M. Just as for polynomials in one variable X, addition and multiplication 
are expressed by the well-known formulas 


So ay X*+ S> bX" = So (ay + by) X", 


eM eM pweM 
S> aX". Sa, X" = 5 ( \~ ay by )X*, 
weM pweM HEM A+v=p 


As usual, the zero polynomial 0 = pare yO: X* serves as the zero element, 
and likewise, X° serves as the unit element of R[M], where the exponent 0 
indicates the neutral element of the monoid M. Also note that R is naturally 
a subring of R[M]. Just identify the elements a € R with their corresponding 
“constant” polynomials aX°. The polynomial ring R[M] admits the following 
universal property: 


Proposition 1. Let y: R — R’ be a ring homomorphism and a: M —> R' a 
monoid homomorphism, where we view R! as a monoid with respect to the ring 
multiplication. Then there exists a unique ring homomorphism ®: R[M] —> R’ 
satisfying ®|p = yp and ®(X") = o(u) for all we M. 


Proof. To verify the uniqueness assertion, consider an element Dae uw GX" in 
R[M]. If there exists a homomorphism @ satisfying the stated conditions, we 
must have 


(>. a,X") = S~ (a, X") = S~ B(a,)B(X") = > vlay)o(u)- 


Conversely, to establish the existence we can define @ via the preceding equation. 
The properties of a ring homomorphism are easily checked; just use the facts 
that y is a rmg homomorphism and a a monoid homomorphism. 


The property of polynomial rings proved in Proposition 1 is called a univer- 
sal property, since R[M] thereby appears, so to speak, as a master object from 
which all similar constructs combining coefficients in R and monomials in M 
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are derived via homomorphisms. In particular, the universal property uniquely 
characterizes R[M] up to canonical isomorphism. In more detail, this means the 
following. Start out from a ring extension RC S and a monoid homomorphism 
ut: M —+ S, where S' is viewed as a monoid under the ring multiplication, and 
assume that the mapping property stated in Proposition 1 is given, i.e., that 
for each ring homomorphism w: R —> R’ and each monoid homomorphism 
7: M —+ R' with R’ as multiplicative monoid, there is a unique ring homo- 
morphism YW: S —> R’ such that Y|p = w and Wo. =7T. Then the extensions 
RC R[M] and RC 8S are canonically isomorphic. 

We want to briefly justify this, using the general argument that applies 
to any universal property. If we consider the homomorphisms R @ S and 
ut: M —+ S, the universal property of R[M] yields a ring homomorphism 
&: R[{M] —> S that extends the identity on R and furthermore, satisfies 
&(X") = i(y) for all w € M. On the other hand, by the universal property 
of S, the monoid homomorphism M —> R[M], » > X*, leads to a ring 
homomorphism W: S —+ R[M] extending the identity on R and satisfying 
W(i(u)) = X* for all w € M. Hence, 60 W and the identity map constitute 
two ring homomorphisms S —+ S extending the identity on R and leaving ¢(,1) 
fixed for all uw € M. Now the uniqueness part of the universal mapping property 
on S yields 60 W = id and likewise W o @ = id, using the same property on 
R[M]. It follows that @ and W are isomorphisms. 

Now we want to consider the cases M = N” and M = NY), thereby looking 
at polynomial rings in the stricter sense of the word. First let M = N”. We 
define the ith “variable” X;, 1 <i <n, by X@--010-.% where the symbol 1 of 
the exponent is placed at position i. Then, for = (f41,...,f4n) € N” we have 
X" = Xi"... X#", and the elements of R[N"] can be written in a more explicit 
way as sums: 

S ieee 


(H1y-++sfn JEN” 


where the coefficients a,,,.4., € R are unique and, of course, zero for almost 
all indices (f1,.-., Mn). Instead of R[N"] we use the notation R[X,,...,Xn] 
or R[X] and view X = (Xj,...,X,) as a family of variables. Similarly, we 
proceed with monoids of type M = N“), where J is an arbitrary index set. Let 
e; for i € I specify the element of NY) whose components are all 0, except for 
1 at position 7. Then, setting X; = X*, i € I, we have X* = [J,.,X/" for 
Le = (ta)ier € NM. Note that almost all factors of such a product are trivial, so 
that it is actually a finite product. In particular, the elements of RIN] can 


be written as sums of type 
d, mT] x" 


peND ier 


with coefficients a, € R that are unique. Instead of RIN] we also use the 
notation R[X;; 7 € I] or R[X] for ¥ = (X;);er. The elements of R[X] are 
polynomials in finitely many variables X;,,...,X;,, and we can view R[X] as 


the union of all subrings of type R[X;,,...,X;i,,], where the set {i,...,in} 


54 2. Rings and Polynomials 


varies over all finite subsets of J. In particular, computations involving only 
finitely many elements of R[X] can always be carried out in a polynomial ring 
in finitely many variables. 

For simplicity, we will restrict ourselves in the following to the case of poly- 
nomial rings of type RLX1,..., Xn], although the results we prove below remain 
valid mutatis mutandis also for arbitrary sets of variables. Let us add that ba- 
sically, polynomial rings in infinitely many variables will be used only for the 
construction of algebraically closed fields in 3.4. Furthermore, observe, either 
by direct computation or by applying Proposition 1 (see also Exercise 3), that 
there are canonical isomorphisms of type 


AUR c2a5 Ke) (RL a ee) 


for n > 0, using the convention R[X1,...,Xn-1] = R for n = 1. These isomor- 
phisms allow one in many cases to inductively reduce problems on polynomials 
in several variables to the case of one variable. 


Proposition 2. [f R is an integral domain, then for finitely many variables 
X1,..-,Xn, the polynomial ring RLX1,...,Xn] is also an integral domain. 


Proof. We have already seen in 2.1/3 that the proposition is true in the case of 
one variable. But then, using the isomorphism 


PKG cca Me) CR Raga Maa) Beals 


the general case follows by induction. 
Alternatively, one can use a direct argument to show that the product of 
two nonzero polynomials 


f= 3G", g@=y bX” CRG] 


is nonzero if R is an integral domain. Indeed, introduce the lexicographic order 
on N", i.e., we write yw < p’ for indices 


= (Miy---y bm), = (Wyn) EN", 
if for some i, 1 <i < n, we have 
/ / / 
Hi = fy, vey Mind = Hg Mi S Me 


Now choose 7 € N maximal (with respect to the lexicographic order) among all 
psuch that a, A 0, as well as 7 maximal such that b, 4 0. Then the coefficient of 
the monomial X"*” in fg equals azby. In particular, if R is an integral domain, 
we have agby # 0 and hence fg ¥ 0. 


Given an index pp = (t1,---,Un) € N”, we write |u| := fi +... + fl, and 
call this number the degree of . Furthermore, for a polynomial f = >) a,X" 
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in R[LX1,...,Xn], we call f; = yujni uX" for i € N the homogeneous part 
of f of degree i. In particular, f may be interpreted as the sum of its homo- 
geneous parts, i.e., f = S07, fi. We call f homogeneous if f equals one of 
its homogeneous parts or, more precisely, homogeneous of degree i if f = fj. 
A homogeneous polynomial f 4 0 is always homogeneous of a unique degree 
i > 0, whereas the zero polynomial is homogeneous of every degree i > 0. 
Furthermore, 


deg f = max{i EN; f; #0} = max{|pl ; a, 4 Of 


is called the total degree of f, with the convention that deg f := —oo for f = 0. 
Note that the total degree of a polynomial in a single variable coincides with 
the degree as defined in 2.1. Moreover, there is the following analogue of 2.1/2: 


Proposition 3. Let f,g © R[X,...,Xn] be polynomials with coefficients in a 
ring R. Then 


deg(f +g) < max(deg f, deg g), 
deg(f - g) < deg f + deg g, 


and deg(f -g) = deg f + deg g if R is an integral domain. 


Proof. The estimate for deg(f + g) becomes clear if we decompose polynomials 
in R[X,,...,X,] into the sums of their homogeneous parts. Furthermore, if 
deg f = r and deg g = s, andif f = )7;_, fi, g = )o}-9 gi are the decompositions 
into homogeneous parts, we obtain, assuming r,s > 0, 


f-9= f+ 9s + (homogeneous terms of degree <r +s), 


where f,.-gs equals the homogeneous part of degree r+s in f-g. This shows that 
deg(f +g) < deg f + deg g. If R is an integral domain, then f,,g, 4 0 implies 
f-9s £9, due to Proposition 2, so that the degree of fg is r+ s. 


Corollary 4. If R is an integral domain, then 


(A Suiegkaly = RS 


Next, we want to adapt the universal property of polynomial rings, which 
uniquely characterizes these rings up to canonical isomorphism, especially 
to polynomial rings of type R[X1,...,Xn]. Since a monoid homomorphism 
a: N" —+ R’ is uniquely determined by the images of the canonical “genera- 
tors” of N", namely of the elements of type (0,...,0,1,0,...,0), we can derive 
the following version of Proposition 1: 


Proposition 5. Let 9: R —> R’ be a ring homomorphism and consider finitely 
many elements 11,...,Un € R’. Then there exists a unique ring homomorphism 
®: R[X,...,Xn] — R’ satisfying ®|p = y and O(X;) = x; fori =1,...,n. 
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Writing x = (@,...,%») and r# = ai'!... aH for « € N” in the situation of 
the preceding proposition, we can describe the homomorphism @ by 


®: R[X,...,Xn] +R, Yo aX" DY play)", 


similarly as in the case of a single variable. We call ® a substitution homomor- 
phism, since the tuple x is substituted for X. In particular, if R is a subring of R’ 
and y: R& R’ the canonical inclusion, then the image under @ of a polynomial 
f= Ma,.X" € RLM,...,X,] will generally be denoted by f(x) = Yo a,a". If 
f(x) = 0, we call x a zero of f. Moreover, we use the notation 


R[x] = O(R[X,...,Xp]) = {So a,c" a, € R, a, =0 for almost all ys} 


for the image of R[Xj,...,X,] with respect to @. Then R[x], or in more 
explicit terms R[21,...,2n], is the smallest subring of R’ that contains R and all 
components 21,...,2p of x. Suggestively, we call R[a] the ring of polynomials 
in x (or better, of all polynomial expressions in x) with coefficients in R. 

Substitution homomorphisms will play an important role later on. As a 
typical example, let us mention the notion of transcendence. 


Definition 6. Let RC R’ be a ring extension and « = (a1,...,%p) a system 
of elements in R'. Then x is called algebraically independent or transcenden- 
tal over R if for a system of variables X = (X1,...,X,) the ring homomor- 
phism RLX] —> R', f —> f(a), is injective and thus induces an isomorphism 
R[X] — R[2]. Otherwise, x is called algebraically dependent. 


In particular, any system x = (21,...,%,) that is transcendental over R, 
admits the same properties, as does a system of variables. For example, we have 
already pointed out in the introduction that each of the numbers e and 7 € R, 
well-known from analysis, is transcendental over Q; proofs for this fact go back 
to Ch. Hermite [8] and F. Lindemann [13]. 

Finally, let us refer to the reduction of coefficients of polynomials, a pro- 
cess that, formally speaking, belongs to the subject of substitution homomor- 
phisms as well. If a C R is an ideal and y: R —> R/a the canonical ho- 
momorphism, we can apply Proposition 5 and consider the homomorphism 
®: R[X] — (R/a) LX] that extends y and maps X to X. We say that & 
reduces the coefficients of polynomials in R[.X] modulo the ideal a. For exam- 
ple, the homomorphism Z[X] —> Z/(p)[X] for a prime number p transforms 
polynomials with integer coefficients to polynomials with coefficients in the fi- 
nite field F, = Z/(p). 


Exercises 


1. The polynomial ring R[M] with coefficients in a ring R has been defined for 
commutative monoids M. If we want to extend the definition to not necessarily 
commutative monoids, which new phenomena have to be paid attention to? 
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2. Examine how far the results of the present section on polynomial rings in finitely 
many variables RL.X1,...,Xn] can be generalized to polynomial rings in arbitrary 
sets of variables R[X]. 


3. For two monoids M, M"', consider their Cartesian product M x M’' as a monoid 
with componentwise law of composition. Show that there is a canonical ring iso- 
morphism R[M][M'] > R[M x M’). 


4. Let R be a ring. Consider Z, as well as Z/mZ for m > 0, as monoids with respect 
to addition and show that 


R[Z] ~ R[X,Y]/(1-— XY), R[Z/mZ] ~ R[X]/(X™- 1). 


5. Let K be a field and f € K[Xj,...,Xn] a homogeneous polynomial of total 
degree d > 0. Show that for every prime factorization f = p,...p,, the factors p; 
are homogeneous as well. 


6. Consider the polynomial ring R[X,...,X;] in n variables over a ring R 4 0 and 
show that the number of monomials of total degree d € N in R[X1,..., Xn] is 


n+d-1 
n-1 /° 
7. Let K bea field and y: K[Xq,...,Xm] —> K[X,...,Xn] a ring isomorphism 
such that y|K = idx. Show that m = n. 
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Let K be a field and f € K[X] a nonzero polynomial in a variable X. Then, if 
a € K isa zero of f, the linear polynomial X — a divides f. Indeed, Euclid’s 
division of f by X — a leads to an equation 


f=q:(X-a)+r, 


where degr < 1 and hence r € K. Furthermore, substituting X by a yields 
r = 0. We say that a is a zero of multiplicity r if X — a appears in the prime 
factorization of f with a power precisely r. Therefore, looking at degrees, we 
can assert the following: 


Proposition 1. Let K be a field and f € K[X] a polynomial of degree n > 0. 
Then, counting multiplicities, f admits at most n zeros in K. The number of 


these zeros is precisely n if and only if all factors of the prime factorization of 
f in K[X] are linear. 


In particular, we thereby see that a polynomial of degree < n for some n € N 
equals the zero polynomial as soon as it has more than n zeros. Therefore, if 
K is an infinite field, the equality f = 0 (zero polynomial) is equivalent to the 
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fact that f(a) = 0 for all a € K (resp. for all a from a given infinite subset of 
kK). On the other hand, for a finite field F, the polynomial 


f=[[%-o €FLX] 


acF 


is nonzero and satisfies f(a) = 0 for all a € F. 
There is a simple criterion for the existence of multiple zeros. To formulate 
it consider the map 


D: K[X] > K[X],  SoaXtro Wie x™, 
i=0 i=l 


which is defined just like the usual differentiation of real polynomials (interpret 
ic; as the i-fold sum of c; with itself). Note that D is not a ring homomorphism; 
it is a so-called derivation, i.e., D satisfies the following rules for a,b € K, 
fg € KX]: 


Daf +bg)=aD(f)+bD(g), Di fg) = fD(g) + gD(f). 


We will mostly write f’ instead of Df, calling this the first derivative of f; see 
also 7.4. 


Proposition 2. Let f €¢ K[X], f £0, be a polynomial with coefficients in a 
field K. A zerow€ K of f is a multiple zero (i.e., a zero of multiplicity > 2) 
if and only if (f’)(a) = 0. 


Proof. If a is a zero of f of multiplicity r > 1, then there is a factorization of 
type f = (X —a)"g for some g € K[X] satisfying g(a) 4 0. Since 


fi =(X-a)’g +r(X — alg, 


we see that (f’)(a) = 0 is equivalent to r > 2. 


Corollary 3. An element a € K is a multiple zero of a nonzero polynomial 
f € K[X] if and only if a is a zero of gcd(f, f’). 


For example, if p is a prime number, the polynomial f = X? — X € F,[X] 
does not admit multiple zeros. Indeed, we have f’ = —1, since the p-fold sum 
p-1of the unit element 1 € F, = Z/pZ is zero. 


Exercises 


1. Let K be a field consisting of infinitely many elements and f € K[X1,...,Xn] 
a polynomial vanishing at all points of K”. Show that f = 0, i.e., f is the zero 
polynomial. 
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2. Let K be a field. Show for n € N, n > 1, that the multiplicative group K* 
contains at most n — 1 elements of order n. 

3. Let K be a field. Show that the polynomial ring A[_X] contains infinitely many 
monic prime polynomials. Furthermore, if each nonconstant polynomial from 
K(X] admits at least one zero in K, show that K consists of infinitely many 
elements. 

4, Let K be a field and let f = X°+aX +b€ K[X] bea polynomial admitting a 
factorization into linear factors in KX]. Show that the zeros of f are distinct if 
and only if the “discriminant” A = —4a°® — 27)? is nonzero. 


2.7 A Theorem of Gauss 


The purpose of the present section is to prove the following basic result on 
unique factorization domains: 


Proposition 1 (Gauss). Let R be a unique factorization domain. Then the 
polynomial ring in one variable RX] is also a unique factorization domain. 


There are some immediate consequences: 


Corollary 2. [f R is a unique factorization domain, then the same is true for 
the polynomial ring RLX1,...,Xn]- 


Corollary 3. For any field K, the polynomial ring K[.X1,...,Xn] ts a wnique 
factorization domain. 


In particular, there exist unique factorization domains that are not principal 
ideal domains; just look at the polynomial ring K [X,Y] in two variables X,Y 
over a field A’, or at the polynomial ring in one variable Z[_X] over the ring of 
integers Z. 

For the proof of Proposition 1 we need some preparations. To begin with, 
we construct the field of fractions Q(R) of an integral domain R, taking the 
construction of rational numbers in terms of fractions of integers as a guide. 
Therefore, consider the set of pairs 


M = {(a,b); aE R, be R—{O}} 


it rl 


and define an equivalence relation “~” on it by setting 
(a,b)~ (a0) <=> ab =d. 
The conditions of an equivalence relation are easily checked, namely 
reflexivity: (a,b) ~ (a,b) for all (a,b) € M, 
symmetry: (a,b) ~ (a’,b') => (a’,b') ~ (a,b), 
transitivity: (a,b) ~ (a’,b’), (a’,b') ~ (a”,b") => (a,b) ~ (a",b"). 
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For example, to justify the transitivity we argue as follows: 


(a,b) ~ (a’,0) ab! = a'b ab'b" = a'bb", 
(a’, b’) Pe (a”, b”) a'b" ab! a’ bb" _ al bb’, 


so that 
(a,b) ~ (a’,0'), (a, 0) ~ (a", 0") => ad" = abl’. 
However, the last equation yields ab” = a’b, and thus (a,b) ~ (a",b”), since R 
is an integral domain. 
Now observe that the equivalence relation 
equivalence classes; let 


“6 


~” defines a partition of M into 


Q(R) = M/~ 


be the corresponding set of classes. The equivalence class attached to a pair 
(a,b) € M is denoted by ¢ € Q(R), using the notion of fractions. Observe that 
due to the definition of “~”, we have 


; 5 ab! =a'b. 
It is easily checked that Q(R) is a field under the addition and multiplication 


of fractions 


i a a a aa’ 
b 6D by)’ b OU bb’? 
where one shows as usual that the laws “+” and “-” are well defined. We call 


Q(R) the field of fractions of R. Furthermore, 


R—Q(R), aro >: 
is an injective ring homomorphism, which allows us to view R as a subring of 
Q(R). For example, taking R = Z, the associated field of fractions is Q(Z) = Q, 
the field of rational numbers. If K is a field and _X a variable, the field of fractions 
Q(K[X]) is called the field of rational functions in the variable X with coeffi- 
cients in A’, and is denoted by K(X) = Q(K[X]). Analogously, one considers 
rational function fields K(X1,...,Xn) = Q(K[X1,...,Xn]) in finitely many 
variables X,,...,X;, and, more generally, function fields K(X) = Q(AK[X]) in 
a system of variables ¥ = (X;)jer. 

The above construction of fields of fractions attached to an integral domain 
extends to a more general setting. Fixing a ring R (that may have nontrivial 
zero divisors), consider a multiplicative system S C R, ie., a multiplicative 
submonoid of R. Then, similarly as before, we can define the ring of fractions 
(in general it will not be a field) 


str={;aeR, sesh, 


where, due to possible nontrivial zero divisors in R, one works relative to the 
following equivalence relation: 
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a . 
—-=— <>) | there exists some s” € S such that as’s” = a'ss". 
8s 8 


We use Rg as a shorthand notation for S~'R and call this ring the localization 
of R by S. However, observe that the canonical map R —> $~!R might not 
be injective, since it might have a nontrivial kernel. The kernel consists of all 
elements a € R such that there exists an element s € S satisfying as = 0. If R 
is an integral domain (this is the main case to be considered in the following), 
we have Q(R) = S~!R, where S := R— {0}. 


Remark 4. Let R be a unique factorization domain, and P a system of repre- 
sentatives of the prime elements in R. Then every fraction ¢ € Q(R)* admits 


a unique factorization 
VL 
i =€ | | pe 


where € € R* and vy € Z with v, =0 for almost all p. In particular, 5 belongs 
to R if and only if v, > 0 for all pe P. 


Proof. Using the prime factorizations of a and b, the existence of the stated fac- 
torization follows. Furthermore, the uniqueness is a consequence of the unique- 
ness of prime factorizations in R, at least if one considers factorizations of ¢ 
satisfying v, > 0 for all p. However, we can reduce to this case by multiplying 
; by suitable nonzero elements of R. 


If we have « = ; in the situation of Remark 4, we write more explicitly 
Vp(x) instead of vp, adding v,(0) := oo as a convention. Then the uniqueness 
assertion of Remark 4 implies 


Up(LY) = Up(x) + vp(y) 
for x,y € Q(R). Moreover, considering polynomials f = >> a;X' € Q(R)[X] in 
one variable X, we set 
vp f) = min v9(a), 
where f = 0 is equivalent to v,(f) = co (for some, and hence all, p € P). Also 
note that f belongs to R[X] as soon as v,(f) > 0 for all p € P. 


In order to show that the polynomial ring over a unique factorization domain 
is of the same type, we need a key fact about the function v,(-): 


Lemma 5 (Gauss). Let R be a unique factorization domain and p € R a prime 
element. Then v,(-) satisfies the following relation for elements f,g € Q(R)[X]: 


Up( £9) = Up(f) + Up(g). 


Proof. As mentioned above, the stated relation holds for constant polynomials, 
ie., for f,g € Q(R) and hence also for f € Q(R) and arbitrary polynomials 
g € Q(R)LX]. 
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To deal with the general case we may assume f,g # 0. Furthermore, due 
to the preceding consideration, we are allowed to multiply f and g by con- 
stants from Q(R)*. In particular, representing the coefficients of f as fractions 
of elements in R, we can multiply f by the least common multiple of all denom- 
inators. Proceeding in the same way with g, we are reduced to the case that 
f and g are polynomials with coefficients in R. Moreover, we can divide f by 
the greatest common divisor of all its coefficients, likewise for g, and thereby 
assume 


f,g € R[X], Up(f) = 0 = %(9). 
Then it remains to show that v,(fg) = 0. To do this, consider the homomor- 
phism 
®: REX] — (R/pR)[X] 
reducing coefficients. The kernel ker & consists of all polynomials in RX] whose 
coefficients are divisible by p, i.e., 


ker ® = {f € R[X]; v,(f) > Of. 


Since v,(f) = 0 = (9), we get O(f),O(g) # 0. Now R/pR is an integral 
domain, and the same is true for (R/pR)[X], by 2.1/3. Therefore, we conclude 
that 


P( fg) = Of) P(g) #0, 


and in particular, v»(fg) = 0. 


Corollary 6. Let R be a unique factorization domain and h € R[X] a monic 
polynomial. Assume that there is a factorization h = f-g into monic polynomials 


fig © Q(R)[X]. Then necessarily f,g € RLX]. 


Proof. We have v,(h) = 0, as well as (f),u)(g) < 0 for every prime element 
p € R, due to the fact that h, f, and g are monic. Furthermore, Gauss’s lemma 
yields 

Up(f) + Up(g) = Mp(h) = 0 


and hence v,(f) = v,(g) = 0 for all p. However, this means that f,g € RLX], 


Pp 
as claimed. 


A polynomial f € R[X] with coefficients in a unique factorization domain 
R is called primitive if the greatest common divisor of all its coefficients is 
1 or, equivalently, if v,(f) = 0 for all prime elements p € R. For example, 
monic polynomials in R[X] are primitive. Moreover, proceeding similarly as in 
the proof of Corollary 6, we can conclude for a polynomial h € R[X] and a 
factorization h = f +g with f € RLX] primitive and g € Q(R)[X] that g is 
already contained in R[X]. 

In the following we will frequently use the fact that every nonzero polyno- 
mial f € Q(R)[X] can be written as f = af with a constant a € Q(R)* and a 
primitive polynomial f € RLX]. Just set 
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a= Ilo: f = af, 


pEP 


where P is a system of representatives of the prime elements in R. 

Now we are able to prove the result of Gauss announced at the beginning 
of the present section. As a by-product, we will get a characterization of the 
prime elements in RLX]. 


Proposition 7 (Gauss). Let R be a unique factorization domain. Then the 
polynomial ring R[X] is a unique factorization domain as well. A polynomial 
q€ R[X] is prime if and only if: 

(i) q is prime in R, or 

(ii) g is primitive in RLX] and prime in Q(R)[X]. 

In particular, a primitive polynomial q € RLX] is prime in RX] if and 
only if it is prime in Q(R)[X]. 


Proof. Let q be a prime element in R. Then R/qR is an integral domain and 
the same is true for RLX]/qR[X] ~ (R/qR)[X]. From this we conclude that 
q is prime also in R[LX]. 
re consider a primitive polynomial q¢ € R[X] such that qg is prime in 
R)[X]. In order to show that q is prime even in RLX], consider polynomials 
. € R[X] such that g| fg in RLX]. Then we have q| fg in Q(R)[X] as well. 
Since q is prime in Q(R)[X], it divides one of the two factors, say q| f, and 
there exists h € Q(R)[X] such that f = gh. Now apply Gauss’s lemma to the 
latter equation. Since q is primitive, we get for every prime element p € R that 


O< Vy(f) = Vp(q) + Yp(h) = Yp(h), 


and hence h € R[X]. In particular, g| f in R[X], and it follows that qg is prime 
in R[X]. 

It remains to show that RLX] is a unique factorization domain and that 
every prime element in R[X] is of type (i) or (ii). To achieve this it is clearly 
enough to show that every nonzero nonunit f € RX] admits a factorization 
into prime elements of type (i) and (ii). Let us establish the latter fact. Write 
f=a f , where a € R is the greatest common divisor of all coefficients of f and 
hence f is primitive. Since a is a product of prime elements in R, it is enough 
to show that the primitive polynomial f is a product of primitive polynomials 
in R[X] that are prime in Q(R)[X]. Let f = cf... f, be a factorization into 
prime elements from Q(R)[X], for a constant c € Q(R)*. Choosing c suitably, 
we may assume that all i are primitive in R[X]. Then Gauss’s lemma implies 
for every prime element p € R that 


Vp(f) = vp(c) + vp(fa) +--+ ¥p(Fr), 


and since 
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that vp(c) = 0; this means that c is a unit in R. Now replacing fi by cf, we 
see that f is a product of prime elements of the desired type. 


Exercises 


1. Let R be a unique factorization domain and &: RLX] —> RLX] a ring auto- 
morphism that restricts to an automorphism yp: R —> R. Compare vp(f) and 
Vo(p)(O(f)) for polynomials f € R[X] and prime elements p € R, and check 
whether ®(f) is primitive when f is primitive. Show for a € R that a polynomial 
f is primitive if and only if f(X +a) is primitive. 


2. Consider a unique factorization domain R with field of fractions K and with a 
system P of representatives of its prime elements. For f © K[X]|—{0} denote by 
ap i= Ther p’?\D the “content” of f. Formulate the assertion of Gauss’s lemma 
(Lemma 5) in an equivalent way using the notion of content. 


3. Consider the rational function field A(X) in one variable X over a field K, as 
well as the polynomial ring K(X)[Y] for a variable Y. Let f(Y),9(Y) € K[Y] 
be coprime with deg f(Y)-g(Y) > 1. Show that f(Y) — g(Y)X is irreducible in 
K(X)[Y]. 


4. Let R be a unique factorization domain. Show: 


(i) For a multiplicative system S C R, the ring of fractions S~1R is a unique 
factorization domain again. How are the prime elements of R related to 
those of S~!R? 


(ii) For prime elements p € R set Rp := on where S;, = R-(p). A polynomial 
f € R[X] is primitive if and only if the induced polynomial f, € Rp[X] is 
primitive for every prime element p € R. 


5. Universal property of rings of fractions: Let R be a ring and S Cc Ra mul- 
tiplicative system. Show for every ring homomorphism y: R —> R’ satisfying 
v(S) Cc R™ that there exists a unique ring homomorphism G: S~'R —> R’ such 
that yp = Gor; here r: R —> S~!R denotes the canonical homomorphism given 
by a> ¢. 


6. Partial fraction decomposition: Let f,g € K[X] be polynomials with coefficients 
in a field K, where g is monic with prime factorization g = gj’... gin and pairwise 
nonassociated prime polynomials gi,...,9n- Show that in the field of fractions 
K(X) = Q(K[X]) there is a unique decomposition 


f= f+ yo 


j=l j=l Gi 


with polynomials fo,cij € K[X], where degcj; < degg;. In particular, if the 
prime factors g; are linear, the cj; are of degree 0 and thus are constant. Hint: 
Prove the existence of a decomposition fg~! = fo + 774 fig; such that git fi 
and deg f; < deg g/*. Then apply the g;-adic expansion to the fj; cf. Exercise 4 
from 2.1. 
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2.8 Criteria for Irreducibility 


Let R be a unique factorization domain and K = Q(R) its field of fractions. 
In the following we want to discuss some techniques for checking whether a 
given polynomial f € AK [X]—{0} is irreducible or prime, which are the same 
in unique factorization domains, due to 2.4/10. Depending on f, there is al- 
ways a constant c € K* such that f = cf is a primitive polynomial in R[X]. 
Furthermore, we can conclude from the result of Gauss 2.7/7 that f and f are 
irreducible in KX] if and only if f is irreducible in R[X]. In this way, the 
irreducibility of polynomials in AX] can be reduced to the irreducibility of 
primitive polynomials in RLX ]. 


Proposition 1 (Eisenstein’s criterion). Let R be a unique factorization domain 
and f = a,X"+...+a9 € R[X] a primitive polynomial of degree > 0. Assume 
there is a prime element p © R such that 


ptan, pla; for i<n, pt ao. 


Then f is irreducible in RLX] and hence, by 2.7/7, also in Q(R)[X]. 


Proof. Suppose f is reducible in RLX]. Then there is a factorization 


s 


f=gh, say se n= ye, 


i=0 i=0 


where r+s =n, andr > 0,s > 0. Furthermore, from our assumption on f, we 
conclude that 


an = byCs # 0, pt{b,, Ditx 
a = boco, plboco, Pp’ tboco, 


and we may assume p| bo, p{co. Now let t < r be maximal such that p|b, for 
O0<7<t. Setting b; = 0 fori >r and c; = 0 for i > s, we get 


Ge41 = boci41 +... + bp41€0, 


where a4, is not divisible by p. Indeed, boci41,..., bc; are divisible by p, due 
to the definition of t, and b:,1co is not. But then we must have t+ 1 =n, due 
to our assumption on f, and therefore r = n, s = 0, which is in contradiction 
tos >0. 


Next we want to discuss the reduction test for irreducibility. 


Proposition 2. Let R be a unique factorization domain, p © R a prime ele- 
ment, and f € RIX] a polynomial of degree > 0 whose leading coefficient is 
not divisible by p. Furthermore, let ®: RIX] —> R/(p)[X] be the canonical 
homomorphism reducing coefficients mod p. Then: 
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If ®(f) ts irreducible in R/(p)[X], then the same is true for f in Q(R)[X]. 
If in addition, f is primitive, then it is irreducible in RLX] as well. 


Proof. First, assume that f € R[LX] is primitive. Then if f is reducible, there is 
a factorization f = gh in RLX], where degg > 0 and degh > 0. Furthermore, 
p cannot divide the leading coefficient of g or h, since p does not divide the 
leading coefficient of f. Now we have 


where ®(g) and ®(h) are nonconstant polynomials in R/(p)[X], and it follows 
that &(f) is reducible. Thus, by contraposition, &(f) irreducible implies that f 
is irreducible in R[X]. 

To deal with the general case, write f = c- f for a constant c € R and 
a primitive polynomial f € R{X]; note that p cannot divide c or the leading 
coefficient of f. Then if @(f) is irreducible, the same is true for &( 7 ), and it 
follows, as we just have seen, that f is irreducible in R[X]. Applying the result 
of Gauss 2.7/7, we conclude that f and hence also f are irreducible in Q(R)[X]. 


Let us add that alternatively, Eisenstein’s criterion can be obtained as 
a consequence of the above reduction test for irreducibility. Indeed, let us 
place ourselves in the situation of Proposition 1. If there is a factorization 
f = gh for polynomials g,h € R[X] of degree < n, we can apply the reduc- 
tion homomorphism ®: RLX] —> R/(p)[X], thereby obtaining an equation 
G,X" = O(f) = O(g)P(h). We claim that &(g) and (h) are nontrivial powers 
of X, up to constant factors from R/(p). To justify the claim, interpret the 
preceding factorization in the polynomial ring k|X] over the field of fractions 
k of R/(p), which is a unique factorization domain. In particular, the constant 
parts of g and h will be divisible by p, and it follows that the constant part of 
f is divisible by p?, which is in contradiction to the assumption on f. 

We want to add some examples showing how to apply the above irreducibil- 
ity criteria: 


(1) Let & be a field and K := k(t) the field of rational functions in a 
variable t over k. Then the polynomial X" —t € KX] is irreducible for n > 1. 
Indeed, R := k[t] is a unique factorization domain with field of fractions kK’. 
Furthermore, ¢t € R is a prime element, and X” —t is a primitive polynomial in 
R[X] such that Eisenstein’s criterion can be applied for p := t. 


(2) Let p € N be a prime number. We claim that f(X) = X?-'+...+1 
is irreducible in Q[X]. To justify this we apply Eisenstein’s criterion to the 
polynomial f(X + 1), using the fact that f(X + 1) is irreducible if and only if 
the same is true for f(X). Note that 
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XP — 1 
(H=sS, 
—_ (X11 a p-l P p- Pp 
f(X +1) = = x +(B)x tee.t (22). 


The conditions of Eisenstein’s criterion are satisfied for f(X + 1), since we have 
( ig ) = pand p| C) for vy =1,...,p—1; just observe that 


- (7) -Rer ee 


Vy Let 


admits for v = 1,...,p — 1a prime factor p in the numerator, but not in the 
denominator and hence is divisible by p. 

(3) f = X° 4+ 3X? — 4X —1 is irreducible in Q[X]. Indeed, view f as a 
primitive polynomial in Z[X] and reduce coefficients mod 3. It remains to show 


that the polynomial 
X8—-X-1¢€F3[X] 


is irreducible. This is easily checked, since the polynomial does not admit zeros 
in F3. More generally, one can show (cf. Exercise 2 below) that the polynomial 
X? — X — 1 is irreducible in F,[X] for every prime number p. 


Exercises 


1. Show that the following polynomials are irreducible: 
(i) X443X34 X2-2X +1€ Q[X]. 
(ii) 2X4 + 200X3 + 2000X? + 20000X + 20 € QLX]. 
(iii) X°Y + XY? -X -Y+1¢€Q[X,Y]. 


2. Let p € N be a prime number. Show that the polynomial g = X? — X — 1 is 
irreducible in F,[X]. Hint: Note that g is invariant under the automorphism 
tT: F,[X] — F,[X], f(X) +> f(X 4+ 1), and study the action of 7 on the 
prime factorization of g. 
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In the present section we want to generalize the concept of vector spaces over 
fields to modules over rings, our main objective being modules over principal 
ideal domains. For example, any abelian group can be viewed as a Zmodule, 
i.e., as a module over the ring Z. In any case, the study of abelian groups, 
in particular the classification of finitely generated abelian groups, is a good 
motivation for developing the theory of elementary divisors, up to its central 
result, the structure theorem for finitely generated modules over principal ideal 
domains. This theorem contains the classification of finitely generated abelian 
groups as a special case and, beyond this, admits other interesting applications, 
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such as the existence of canonical forms for endomorphisms of finite-dimensional 
vector spaces; cf. Exercise 3. In the following we will prove the so-called elemen- 
tary divisor theorem, which clarifies the structure of submodules of finite rank 
in free modules over principal ideal domains. As a corollary, we derive the just 
mentioned structure theorem for finitely generated modules over principal ideal 
domains. 

Let A be a ring, general for the moment, but later assumed to be a principal 
ideal domain. An A-module is an abelian group M, together with an exterior 
multiplication 

Ax M—> M, (a,u) > a-2, 


that satisfies the usual “vector space axioms” 
a:(a+y)=a-r+a-y, 

(a+b)-a=a-4+b-a, 

a-(b-x) = (ab)-x 


le 2= 2 


d 


? 


for elements a,b € A, x,y € M. Homomorphisms between A-modules, also 
referred to as A-homomorphisms, are defined just as in the context of vector 
spaces, likewise submodules of an A-module M, as well as the residue class mod- 
ule M/N of an A-module M by a submodule N. Furthermore, the fundamental 
theorem on homomorphisms 1.2/6 remains valid in the module context. If we 
consider A as a module over itself, the ideals of A coincide with the submodules 
of A. Moreover, for any ideal a C A we can view the residue class ring A/a as 
an A-module. 

As already mentioned, every abelian group G can naturally be viewed as 
a Z-module. Just define the product map Z x G —> G, (a,x) -—> ax, by 
az = >i, 2 for a > 0 and ax = —(—a)x for a < 0. On the other hand, 
every Z-module M gives rise to an abelian group G by forgetting about the 
Z-multiplication on M. It is easily verified that in this way, abelian groups and 
Z-modules correspond bijectively to each other and that the correspondence 
extends to homomorphisms, subgroups, and submodules, as well as to residue 
class groups and residue class modules. To give another example, consider a 
vector space V over a field AK, together with a K-endomorphism y: V — V. 
Then V becomes a module over the polynomial ring in one variable K LX] if 
we define the multiplication by 


K[X]xV—V, os aiX',v) r+ So ay'(v). 


On the other hand, for every K[X]-module V we can consider its underlying 
k-vector space together with the K-endomorphism y: V —> V that is given 
by multiplication by X. Also in this case, pairs of type (V,v), consisting of a 
k-vector space V and a K-endomorphism y: V —> V, correspond bijectively 
to K LX ]-modules. 
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For a module M and a family of submodules M; C M, i € I, their sum is 
defined as usual by 


M' =~ M,= Se x; € M,, x; = 0 for almost all i € rh. 


iel ier 


If every x € M’ admits a representation x = ve 7% With elements 7; € M; 
that are unique, we call M’ the direct sum of the Mj, writing M' = @,.,; Mi in 
this case. For example, a sum Mj, + M2 of two submodules of // is direct if and 
only if M, 7 Mz = 0. Furthermore, given a family of A-modules (M;);e7, we can 
naturally construct an A-module M that is the direct sum of the M;. Indeed, 
let 
M= { (#iier € [] Mi; 2; =0 for almost alll i} 
ier 

and identify M/; in each case with the submodule of M consisting of all families 
(xy )iver, Where xy = 0 for i! 4 i. 

A family (2;)ier of elements of an A-module M is called a system of genera- 
tors of M if we have M = >7,., Az;. If M admits a finite system of generators, 
we say that M is finitely generated, or simply that M is a finite A-module.® 
Furthermore, the family (2;);er is called free or linearly independent if from an 
equation Ye 7, a2; = 0 with coefficients a; € A we can conclude that a; = 0 
for alli € J. A free system of generators will also be referred to as a basis; 
in this case every element x € M admits a representation # = }7,-,a;v; with 
coefficients a; € A that are unique, and we say that M is a free A-module. For 
example, A” for n € N is a free A-module, just as A“) is for an arbitrary index 
set I. 

If we consider a field K instead of a general ring A as coefficient domain, the 
theory of A-modules specializes to the theory of K-vector spaces. Furthermore, 
let us point out that computations in a module M over a ring A follow to a 
large extent the rules we are used to in vector spaces over fields. However, there 
is one major exception that has to be observed: from an equation ax = 0 for 
elements a € A, x € M we cannot necessarily conclude that a or 2 vanishes, 
since even for a 4 0 there might not exist an inverse a~! in A. As a consequence, 
A-modules, even finitely generated ones, do not necessarily admit a basis. For 
example, if a C A is a nontrivial ideal, then the residue class ring A/a is an 
example of such an A-module that is not free. 

From now on let A be an integral domain. An clement x of an A-module M is 
called a torsion element if there exists an element a € A—{0} such that ax = 0. 
Due to the fact that A is an integral domain, the torsion elements constitute a 
submodule T C M, the so-called torsion submodule of M. If T = 0, we call M 
torsion-free; and if T = M, a torsion module. For example, every free module is 
torsion-free, and every finite abelian group, viewed as a Z-module, is a torsion 
module. Further, we define the rank of an A-module M, denoted by rank M, 


3 Observe the usage of language: in contrast to the notions of finite group, finite ring, and 
finite field, we do not require that a finite A-module consist of only finitely many elements. 
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as the supremum of all numbers n such that there exists a linearly independent 
system of elements 71,...,2, in M. In this way, the rank of a module is defined 
similarly to the dimension of a vector space. Note that M is a torsion module 
if and only if its rank is zero. 

Now let S = A—{0} and consider the field of fractions K = S~'A of the 
integral domain A. For any given A-module M, we can construct the associated 
K-vector space S~'M by proceeding as in the case of rings of fractions in 
Section 2.7. Indeed, let S~'M be the set of all fractions of type = for « € M 
and s € S, where © is identified with x if there exists an element s” € S such 
that s’"(s'’x — sa’) = 0. Then S~!M becomes a K-vector space under the usual 
rules of fractional arithmetic, and it is verified without difficulty that the rank 
of M coincides with the dimension of S~!M as a K-vector space. Furthermore, 
the kernel of the canonical map M —> S-!M, 2 +> 7, equals the torsion 
submodule T Cc M. 

From now on we will always assume that A is a principal ideal domain. For 
technical reasons we need the notion of length of an A-module M, in particular 
in the case that M is a torsion module, which is defined as the supremum /4(/) 
of all numbers ¢ such that there is a chain of submodules 


0SGMEOMC...¢ M@=M 


of length @. For example, the zero module is of length 0, and the free Z-module 
Z is of length oo. If V is a vector space over a field AK, then the length Ix (V) 
coincides with the vector space dimension dim, V. 


Lemma 1. (i) Let A be a principal ideal domain and a € A an element with 
prime factorization a = p,...pr. Then l4(A/aA) =r. 

(ii) Let M be an A-module that is the direct sum of two submodules M’ and 
M". Then l4(M) = 14(M') +14(M"). 


Proof. We start with assertion (ii). If there are chains of submodules 


Mi 
mM? 


M. 
M. 


Mi = M’, 
M" = M" 


N~ 


0 
0 


= 


C ma 
= z= 
amen ¢ 
= = 


iN An 
iN in 


bo 


then 


0CM,S60CMZE0C...¢ M80 
CM@MiCMOMYC...¢ MIO@Ml=M 


= 


is a chain of length r+s in M. Consequently, we have l4(M) > l4(M')4+l4(M"). 
To verify the opposite estimate, consider a chain of submodules 


0=MEMCMS...c M=M 


and let 1”: M'’ 6 M” —+ M" be the projection onto the second summand, so 
that ker 7” = M’. Then we get MyM’ © My419M' or m”(My) © 7"(M)y41) for 
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0<.2< ¢, and we can conclude that € < 14(M"‘) +14(M"). Hence, assertion (ii) 
is clear. 

Now assertion (i) is easy to justify. Renumbering the p;, we can look at a 
prime factorization of type a = epy'...p%* for a unit ¢ and pairwise nonassoci- 
ated prime elements p1,..., ps, where r = ,+...+v,. Then, due to the Chinese 
remainder theorem in the version of 2.4/14, we conclude that A/aA, as a ring, 
is isomorphic to the ring-theoretic product [];_, A/p/*A. Moreover, thinking in 
terms of A-modules, the decomposition is interpreted from the additive point 
of view as the direct sum 


A/aA ~ A/pi A®...B A/pe aA. 


Therefore, using assertion (ii), which has already been proved, it is enough 
to consider the case s = l, i.e., the case a = p” for a single prime element 
p € A. The submodules of A/p”A correspond bijectively to the ideals a C A 
satisfying p” € a, and since A is a principal ideal domain, bijectively to the 
divisors p°,p',...,p” of p’. Since p’*'A is strictly contained in p'A for all i, we 
get l4(A/p”) = v, which had to be shown. 


Next we turn to the proof of the elementary divisor theorem, which, as 
mentioned already, is a key result for the study of finitely generated modules 
over principal ideal domains and of finitely generated abelian groups. 


Theorem 2. Consider a finite free module F' over a principal ideal domain A 
and a submodule M C F of rank n. Then there exist elements 11,...,%, € F 
that are part of a basis of F, as well as coefficients ay,...,Qn, € A—{0} such 
that: 


(i) Q1%1,..-,An2n form a basis of M. 
(ii) a;|aiz, for 1<i<n. 
The elements ay,...,Q, are uniquely determined by M, up to associated- 


ness, and are independent of the choice of the elements x1,...,%n. They are 
called the elementary divisors of M Cc F. 


Remark 3. In the situation of Theorem 2, the submodule @Q;j_, Ati C F is 
uniquely determined by M as the saturation M,., of M in F’, which consists 
of all elements y € F such that there exists an element a 4 0 in A satisfying 
ay © M. Furthermore, we have 


Moor /M ~ CB A/a:A. 


i=l 


First, let us deduce Remark 3 from the existence assertion of Theorem 2. 
Clearly, we have a, - (Gii_, Avi) C M and therefore Qj, Ati C Meat. Con- 
versely, consider an element y € Mzat, where ay € M for some a € A— {0}. Due 


72 2. Rings and Polynomials 


to the assertion of Theorem 2, we can enlarge the system 21,...,%,, to a basis 
of F by adding elements x,41,...,2”,. Now represent y as a linear combination 
of this basis, say y = )7_, aj;xj. Since ay € M, we conclude that aa; = 0, 
and in particular a; = 0 for j7 =n+1,...,r. Therefore, y € Qj_, Av;, and we 
get Meat C @i_, Axi, and thus in fact Be, Ax; = Meat. To justify the second 
assertion of Remark 3, consider the A-isomorphisms A “> Ax;, a +> az;, for 
indices i = 1,...,n and observe that the ideal a;A C A is mapped bijectively 
onto the submodule Aa;x; C Ax;. Hence, Ax;/Aa;x; is isomorphic to A/a;A, 
and a direct sum analogue of this argument yields an isomorphism between 


(Di_, Avi)/M and @i_, A/aiA. 


For the proof of Theorem 2 we need the notion of content for elements x € F, 
denoted by cont(x). To define it, consider a basis y,...,y, of F' and represent 
x as a linear combination of the y; with coefficients in A, say « = jet CYj- 
Then we set cont(a) = ged(c,...,c¢,). In this way, cont(x) does not specify a 
particular element of A, but rather a class of associated elements. Note that 
cont(0) = 0, even if F = 0. To show that cont(a) is independent of the choice 
of the basis y,,..., y, of F', consider the A-module F* of all A-homomorphisms 
F —+ A, ie., of all linear functionals on F’. It is easy to see that the ele- 
ments of type v(x) for y € F* constitute an ideal in A, in fact a principal 
ideal (c), and we claim that c = cont(x). To justify this, choose an equation 
cont(x) = }0)_, ajc; with coefficients a; € A; cf. 2.4/13. Then, if y1,...,¢, is 
the dual basis associated to y1,..., yr, characterized by y;(y;) = 0 for i # 7 
and yi(yi:) = 1, we get p(x) = cont(x) for yp = )%_, aj~;. However, since 
cont(x) = gcd(c1,...,¢,) is always a divisor of 7(x) for W € F*, we must have 
c = cont(z). 

Let us list some properties of the notion of content that are used in the 
sequel. 


Lemma 4. In the situation of Theorem 2 the following assertions hold: 
(i) Given x € F there exists p € F* such that p(x) = cont(z). 
(ii) For x € F and  € F* we have cont(zx)|1(z). 
(iii) There exists an element x € M such that cont(x)|cont(y) for all y € M. 


Proof. Due to the considerations above, only assertion (iii) needs to be justified. 
To achieve this, look at the set of ideals of type cont(y)- A, where y varies 
over M/. There is a maximal element among all these ideals, i-e., one that is not 
strictly contained in any of the ideals cont(y)- A, y € MW. Indeed, otherwise we 
could construct an infinite sequence of elements y; in M such that 


cont(y1)-A € cont(y2)- AC... 


is a strictly ascending chain of ideals, contradicting the fact that A is Noetherian; 
cf. 2.4/8. Therefore, we can find an element « € M such that cont(x) - A is 
maximal in the sense just discussed. Furthermore, apply (i) and let » € F* 
satisfy p(x) = cont(x). We want to show that 
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(x) p(x) | y(y) for all y € M. 


To achieve this, consider an element y € M and let d= gced(y(x), y(y)). There 
are elements a,b € A such that ay(x) + bp(y) = d, and hence y(ax + by) = d. 
Furthermore, we get cont(ax + by)|d from (ii), and even cont(az + by) |cont(z), 
since d| (a). However, this implies cont(ax + by) = cont(x), due to the maxi- 
mality property of x. In particular, cont(x) is a divisor of d, and since d| p(y), 
even a divisor of p(y). This verifies (*). 

To prove cont(a) | cont(y), it is enough by (i) to prove y(a) | #(y) for all 
w € F*. Since v(x) | v(x) by (ii), as well as v(x) | p(y) by (*), we may replace y 

ely) 


by y— 3Gy# and thereby assume y(y) = 0. Furthermore, using these divisibility 
relations again, we can replace w by w — wep and assume w(a) = 0. Now let 


d = gcd(y(x), W(y)), say d = ay(x) + bi)(y) for a,b € A. Then, since y(y) = 0 
and w(a) = 0, we get 


(ep + Y)(ax + by) = a(x) + bY(y) = d, 


and thus cont(ax + by) | d by (ii). However, d divides y(a) by its definition. 
Therefore, we have cont(ax + by) | p(x) and even cont(ax + by) = y(a) by the 
maximality property of x. But then y(x)|d, and we conclude that p(x) | w(y) 
as desired, since d|~(y). 


Now we can do the proof of Theorem 2. In a first step we show that every 
submodule M Cc F is free. This fact will then be used in a second step to 
derive the existence part of the theorem. In both cases we use induction on 
n = rank M. So let us start by showing that M is free. If n = 0, we have M = 0, 
since M is torsion-free, and the assertion is clear. Now, assuming n > 0, choose 
an element « € M according to Lemma 4 (iii) that satisfies cont(a) | cont(y) for 
all y € M. Then there is a linear functional y € F* such that y(x) = cont(z), 
cf. Lemma 4 (i), as well as a (unique) element 2, € F’ such that x = y(2)2}. 
Setting Ff’ = kery and M’ = MN F’, we claim that 


(«) F= Az, 6 F’, M=ArQM". 
To justify the decomposition for M, consider an element y € M and write 


(y) (y) . 
= Feet ( - ca) 


Then the left-hand summand belongs to Ax. Indeed, we have y() | p(y), since 
cont(x) | cont(y) by the choice of 2, and since cont(y) | p(y) by Lemma 4 (ii). 
Moreover, the right-hand summand is contained in M’, since it belongs to M 
as well as to kery. In particular, the above decomposition of y shows that 
M = Ax + M'. Next, observe that we have v(x) 4 0, since M 4 0 and thus 
that Ax M’ = 0. It follows that MW is the direct sum of its submodules Ax 
and M’. In the same way one proves that F is the direct sum of the submodules 
Az, and F”; just replace x by x, in the preceding argument and use y(x1) = 1. 
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From the decomposition M = Ax@ M' we conclude that rank M’ < n, since 
x #0. Then M’ is free by the induction hypothesis, necessarily of rank n — 1, 
and we see that M is free as well. This settles our first induction argument. 

For the second induction we proceed in the same way, until we get to the 
decompositions (*). From the first induction we know that F” is a free sub- 
module of F. Thus, by the induction hypothesis, the existence part of The- 
orem 2 is available for the submodule M’ Cc F”. In particular, there exist 
elements %2,...,U, € F’ that are part of a basis of F’, as well as elements 
Q2,-.-,Qn € A—{O0} satisfying a; | aj, for 2 <i < n and with the property 
that ao%2,...,Q,2, form a basis of MW’. It follows that 2,,...,2, are part of a 
basis of F = Ax, ® F’, and that a121,...,Qn%n for ay := y(x) form a basis of 
M = Av @ M’. Thus, to derive the existence part of Theorem 2 it remains only 
to show that a,|a2. To justify the latter divisibility consider a linear functional 
(2 € F* satisfying yo(r2) = 1. Then we get v(x) | y2(agr2) and hence a; | a, 
since cont(a) | cont(a@2x2) by the choice of # and since cont(a222) | y2(a2X2) by 
Lemma 4 (ii). Thereby the existence part of Theorem 2 is clear. 

It remains to prove the uniqueness of the a;. In view of further applications, 
we do this in a slightly more general setting. 


Lemma 5. Let A be a principal ideal domain and let Q ~ @Qj_, A/aiA be 


an A-module, where a1,...,Q% € A—{0} are nonunits such that a; | ays for 
1<i<n. Then the elements a,,...,Q, are uniquely determined by Q, up to 
associatedness. 


Proof. For technical reasons we invert the numbering of the elements a; and 
consider two decompositions 


Q~ @BA/aiA ~ @B A/s;A 
i=1 j=l 


such that a;+1|a; for 1 <i <n, as well as 6;41| 6; for 1 < 7 < m. If there exists 
an index k < min{m,n} satisfying a,A 4 6,A, we choose k minimal with this 
property. Since a;A = G;A for 1 <7 < k and since all elements a;41,...,@n are 
divisors of az, we can decompose a;Q as follows: 


k-1 k-1 
anQ ~ Bay: (A/a:A) ~ Bay: (A/aiA) @ an - (A/H.A) O-.. 
i=1 i=1 


Now use Lemma 1. Comparing both decompositions and using the fact that 
L4(Q) < ov, we see that 14(a;,-(A/6,A)) = 0. This means that a;,-(A/6;,A) =0 
and hence azA C 6;,A. Likewise, we get 6,4 C a,A and thus a,A = 6A, 
contradicting our assumption on k. Therefore, we must have a;A = 6;A for all 
indices i satisfying 1 < i < min{m,n}. Furthermore, if m < n, we conclude 
from Lemma 1 again that @j_,,,,, A/aiA is of length 0 and hence vanishes. 
Consequently, m =n and a; is associated to 6; for i =1,...,n. 
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It remains to explain how to derive the uniqueness assertion of Theorem 2 
from Lemma 5. To do this, assume in the situation of the theorem that we have 
elementary divisors a1,...,Q, satisfying a;|aj41, as well as 6),..., 6, satisfying 
B;\ Biz for 1 <i <n. Then, according to Remark 3 (whose proof was based on 
the existence assertion of Theorem 2 and did not require uniqueness), we get 


an isomorphism 
QD A/aiA ~ DB A/BiA. 
i=1 i=l 


Since A/aA vanishes for units a € A, we can conclude from Lemma 5 that the 
nonunits among aj, Q2,... coincide with the nonunits among (3), 32,..., up to 
associatedness. Since the remaining a; and 6; are units, we get a;A = §;A for 
1<i<n, thereby ending the proof of Theorem 2. 


Next we want to give a more constructive characterization of elementary 
divisors, which will be of special interest for explicit computations. 


Proposition 6. Let A be a principal ideal domain, F a finite free A-module 


with basis 14,...,%,, as well as M C F a submodule of rank n with corre- 
sponding elementary divisors a1,...,Qn. Furthermore, let 2,...,;2%m € M be 
a (not necessarily free) system of generators of M. For j = 1,...,m assume 


2; = Vi, ax; for coefficients a;; € A, and let , fort =1,...,n be the great- 
est common divisor of all t-minors of the coefficient matrit D = (a;;).4. Then 
[iy = Ay... 4, and in particular, ay = py, as well as ayfy—1 = fy for t = 2,...,n. 

In the present situation, the elements ay,,...,Q» are referred to as the ele- 


mentary divisors of the matrix D. 


Proof. To start with, let us verify the assertion for t = 1. Note that (a,) C A 
is the ideal generated by all elements of type y(z) for z € M and y € F*; this 
can be read from the assertion of Theorem 2 or from its proof. In particular, 
evaluate the linear functionals of the dual basis attached to 21,...,2, at the 
elements z;. Thereby it is seen that the ideal (a) can also be generated by the 
coefficients a;;. However, this means that a, is the greatest common divisor of 
all 1-minors of D. 

To prove the assertion for arbitrary t it is convenient to use the t-fold exterior 
power i F of F. For our purposes it is enough to fix the basis 7,...,2, of 
F and to define ms F as the free A-module with basis given by the symbols 
Li, \...A\ 4, Where 1 <4 <<... <% <r. Then, for a permutation 7 € Gi, 
i.e., a bijective self-map of {1,...,t}, we let 


Ting N+.» A Zina = (sen7)- aj, A...A Xi, 
where sgn 7 is the sign of the permutation 7; cf. 5.3. To extend the notion of the 
t-fold “exterior product” 2;,\...A2;, to arbitrary indices 7,...,7, € {1,...,r}, 


“The t-minors of D are the determinants of the (t x t) submatrices of D. Since D, viewed 
as an (r <x m) matrix with coefficients in the field of fractions Q(A), is of rank n, we have 
n < min(r,m). 
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we set 7, A... A 2, = 0 for indices 7; that are not distinct. Then, finally, we 
can define the exterior product z;\...A z% of arbitrary elements z1,...,% € F 
by A-multilinear extension. Due to its construction, this product is multilinear 
and alternating in its factors. For example, for elements of type z; = )7}_, aij; 
we get 


an...ra=(> aati) A... A(> aint) 
i=1 w=1 
iB 
= y Gil ++ Git Ti AN... AN Li, 
115. 14=1 


ye (~~ (sgn 7) - Bicagt ses Aiget) Li N.. A Big; 


1<t1<...<iis<r = 7EGt 


where the coefficients ines, (S80 TT): Gi,qy1+++Giggyt equal the t-minors of the 
coefficient matrix when z1,..., 2 are represented as linear combinations of the 
basis x1,...,2,. It should be noted that this computation can also be used to 
show that the above definition of N F,, together with the t-fold exterior product 
of elements in F’, is naturally independent of the choice of the basis 71,..., >. 

Now we turn back to the system of generators 21,...,Zm of M that is given 
in the setting of the proposition, assuming for the moment that the z; form 
a basis of M or, more specifically, that we have z; = a;x; for 1 = 1,...,m 
and elements a; € A—{0}, where a; | aj+1. Note that due to the elementary 
divisor theorem, such a setting can always be obtained for m = n if we choose 
Y1,...,, as well as 21,..., Z In an appropriate way. Then we see that the t-fold 
exterior power /\' M is naturally a submodule of /\‘ F. Indeed, the elements 
vi, \...AN%;, for 1<t <...< i <r are a basis of NF, while the elements 
Qj, 4, Li, A... AN &;, for 1 < ty <1... < % < mare a basis of AM. Then 
we discover the product a,...a; as the first elementary divisor of the problem 
A‘ M c A‘ F, in accordance with the initial discussion for t = 1. 

In the general setting of the proposition, the elements 21,..., Zm constitute 
a system of generators of M that is not necessarily free. Nevertheless, it follows 
that the t-fold exterior products of type z;, \...A 2%, for 1<itp<...<4<m 
generate the A-module A\‘ M; use a computation as given above. Due to the 
consideration for t = 1 at the beginning of the proof, the first elementary divisor 
of the problem A‘ M c (\' F coincides with the greatest common divisor of all 
coefficients from A that are needed to represent the elements z;, A .../ 2, as 
linear combinations of the basis 7, A...A vi, 1 <a <...<% <r. However, 
as seen before, these coefficients are given by the t-minors of the matrix D, and 
it follows that the first elementary divisor attached to (\’ M C (\ F is given by 
fz. On the other hand, this elementary divisor has already been recognized as 
Q,...Q4, 80 that y= a,... a4. 


In the following we want to describe a constructive method that allows one 
to determine elementary divisors in the setting of Proposition 6 for the matrix 
D = (a;;) or the submodule M C F’. However, as a basic prerequisite, we have 
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to assume that A is a Huclidean domain. We consider A™ a free A-module with 


canonical basis e;,...,@,, and look at the A-homomorphism 
A™ —+ F, €j > 2, 
given by the matrix D relative to the bases e1,...,€m of A™ and x1,...,2, 


of F’. Below we will show that we can transform D using elementary row and 
column transformations—i.e., by interchanging rows (resp. columns), as well as 
addition of a scalar multiple of a row (resp. column) to a further row (resp. 
column)—into a matrix of type 


a, 0 0 0 0 
0 ay 0 0 0 
0 0 ... a, 0 0], 
0 0 0 0 0 
Ot a. 0 Oe. B 


where a; | aj41 for 1 < i < n. These transformations can be interpreted as 
multiplication of D from the left or the right by invertible matrices S € A*”), 
resp. T € A(™*™, Then the resulting matrix SDT still describes the map f, of 
course relative to new bases e/,...,¢/,, of A” and 2,..., 2). of F. In any case, 
M is generated by ay2},...,Qnx/, and the coefficients ay,..., Qn are recognized 
as the elementary divisors of D, resp. MC F. 

To find out about the necessary row and column transformations to be 
applied to the matrix D, we use the Euclidean function 6: A—{0} —> N of 
the Euclidean ring A. For D = 0 nothing has to be shown. Therefore, assume 
D # 0. It is our strategy to apply a series of elementary row and column 
transformations to D such that the minimum 


d= min{6(a) ; a is a coefficient # 0 of D} 


decreases step by step. Since 6 takes values in N, it is clear that the process must 
end after finitely many steps, and hence that d becomes minimal then. If a 4 0 
is a coefficient of the corresponding transformed matrix satisfying d(a) = d, we 
show with the help of Euclidean division that a divides all other coefficients of 
the matrix and hence that a is the first elementary divisor of D. 

In more detail, we proceed as follows. By interchanging rows, resp. columns, 
of D we may assume d = 6(aj,), ie., that d(ai1) is minimal among all 6(a;;) 
such that a;; # 0. If one of the elements of the first column, say aj, is not 
divisible by a,,, we apply Euclidean division by a4, to aj, say aj) = gay, + b, 
where 4(b) < d(a1,) with b ¥ 0, and subtract the q-fold first row from the ith 
row. As a result, the element b occurs at position (7,1), and we see that the 
minimum d of all é-values of nonzero coefficients of D has decreased. To further 
reduce d we start the transformation process anew, until all elements of the first 
column are divisible by the coefficient at position (1, 1). 
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In the same way we proceed with the elements of the first row. Since d 
assumes values in N and thus can decrease only finitely many times, we may 
assume after finitely many steps that every element of the first column as well 
as of the first row is a multiple of a,,. Adding suitable multiples of the first 
row to the remaining rows, we may assume aj, = 0 for 7 > 1. In the same way 
we proceed with the first row, thereby achieving aj; = a1; = 0 for i,j > 1. In 
addition, we may assume that the minimum d coincides with 6(a,,); otherwise 
we start the transformation process again from the beginning on. 

If there should exist indices i, 7 > 1 such that ai, {a;;, we apply Euclidean 
division by ay; to aj, say ai; = gai1 +b, where b 4 0 and 6(b) < d(ai1). Then we 
add the first row to the ith row and thereafter subtract the q-fold first column 
from the jth column. In this way, alongside other changes, a;; is replaced by 8, 
where now 6(b) < 6(ai1), and hence the minimum d of all 6-values of coefficients 
has decreased again. To continue, we start the transformation process anew from 
the beginning, until after finitely many steps we arrive at a matrix (a;;), where 
now a1 = a1; = O for i,j > 1, as well as ay | aj; for all i,7 > 1. Then we 
consider the submatrix (aj;;);,;51 of D = (a;;). Unless it is already zero, we can 
transform it in the same way that we did with D. Using an inductive argument, 
we arrive after finitely many steps at a matrix of the desired type in which the 
elementary divisors appear on the main diagonal and all other coefficients are 
Zero. 


Next we want to derive from the elementary divisor theorem the so-called 
fundamental theorem of finitely generated modules over principal ideal domains, 
whose assertion we split up into Corollaries 7 and 8. Note that in the following, 
A is a principal ideal domain again. 


Corollary 7. Let M be a finitely generated A-module and T C M its cor- 
responding torsion submodule. Then T is finitely generated, and there is a free 
submodule F C M such that M = T@F, where rank M = rank F. In particular, 
M is free if it does not admit torsion. 


Corollary 8. Let M be a finitely generated torsion module over A, and let 
PCA be a system of representatives of all prime elements of A. For p € P 
denote by 

M, = {x © M; p"x =0 for some n € N} 


the so-called submodule of p-torsion in M. Then M decomposes into the direct 


sum 
M=Q™,, 


pEeP 


where M, 1s trivial for almost all p € P. Furthermore, there exist natural num- 
bers 1 < v(p,1) <...< v(p,rp) for each p € P such that 
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where rT, = 0 for almost all p. The numbers rp, V(p, Jp) are uniquely determined 


by the isomorphism 
M ~ BOQ A/p A. 


pEP jp=1 


Combining both results, we see that every finitely generated A-module M 
is isomorphic to a direct sum of type 


where the numbers d, rp, and v(p, jp) as above are uniquely determined by M. 
This is the actual assertion of the fundamental theorem of finitely generated 
modules over principal ideal domains. Before we discuss its proof, let us formu- 
late this theorem especially for finitely generated Z-modules, where it is known 
as the fundamental theorem of finitely generated abelian groups. 


Corollary 9. Let G be a finitely generated abelian group, and let P be the set 
of prime numbers. Then G admits a decomposition into subgroups 


C= FODD Gry, 


PEP jp=1 


where F is finitely generated and free, say F ~ Z*, where Gpj, 1s cyclic of 
p-power order, say Gpj, Z [pI Z, for 1 < v(p,1) < ... < v(p,rp), and 
where rp is zero for almost all p € P. The numbers d,rp,v(p, jp) are uniquely 
determined by G, and the same is true for the subgroups G, = Beat Gp, 

If G is a finitely generated torsion group, in the sense of a finitely generated 
torsion Z-module, then G does not admit a free part and therefore consists of 
only finitely many elements, as we can conclude from Corollary 9. On the other 
hand, it is clear that every finite abelian group is a torsion group. 


Let us turn now to the proof of Corollary 7. If z1,...,2, generate M as 
an A-module, we can define an A-homomorphism f: A” —> M by mapping 
the canonical basis of A” to z,,...,2,. Then f is surjective, and we get an 
isomorphism M ~ A"/ ker f from the fundamental theorem on homomorphisms. 
Next we apply the elementary divisor theorem to the submodule ker f C A’. 
Hence, there exist elements x,,..., 2, forming a basis of A”, as well as elements 
Q1,-+-,Qn, € A, n = rank(ker f), such that a121,...,QpXp, form a basis of ker f. 
From this we get an isomorphism 


M~A™ © fan) A/aiA, 


i=1 
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where @;_, A/a;A corresponds to the torsion submodule T C M, and where 
A'~" corresponds to a free submodule F’ C M, thus implying M = T @ F. 
Furthermore, T ~ @j_, A/a;A is finitely generated, thereby settling the proof 
of Corollary 7. 


To approach the proof of Corollary 8, assume that M is a torsion module. 
Then using the setting of the proof of Corollary 7, we see that M is isomorphic 
to the direct sum €)_, A/a;A. Now decompose the a; into prime factors, say 
a = €iTnep p’) for units €; and exponents v(p, i) that are trivial for almost 
all p. Applying the Chinese remainder theorem 2.4/14, we see that 


A/aA~ BD A/pA, 


pEeP 


and hence that 


M~@ fay A/pPPOA. 


peP i=l 


Clearly, the term @j_, A/ ‘py’? A of the latter decomposition corresponds to the 
submodule M, C M of p-torsion, which is unique; observe that the residue class 
of p in residue class rings of type A/p'"A for p' € P — {p} is always a unit. In 
particular, the above decomposition leads to a decomposition M = @,,- p My. 
Now, looking at the decomposition 


pEP 


My & ran A/p’ A, 
i=1 


we abandon all terms A/p’®)A in which v(p,i) = 0, since these are trivial. 
Furthermore, renumbering the exponents v(p,7) for fixed p in ascending order, 
say 


M, ~ BD A/p”™) A, 


Jp=1 


where 1 < v(p,1) < ... < v(p,rp), the existence assertion of Corollary 8 is 
clear, whereas the uniqueness is a consequence of Lemma 5. 


The methods and results treated in the present section rely in a fundamen- 
tal way on the ideal-theoretic characterization 2.4/13 of the greatest common 
divisor, thus on a characterization that is valid in principal ideal domains, but 
not necessarily in unique factorization domains; cf. Section 2.4, Exercise 2. This 
is why it is not possible to extend the theory of elementary divisors to finitely 
generated modules over general unique factorization domains. 
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Exercises 


Let A always be a principal ideal domain. 


1. Consider a decomposition M =T © F of a finitely generated A-module M into a 
torsion submodule T and a free submodule F. Discuss the uniqueness of such a 
decomposition. Study the same problem for a decomposition of type M = M'@M", 
where M' ~ A/p"A and M" ~ A/p°A for a prime element p € A. 


2. A torsion-free A-module is free if it is finitely generated. Can we extend this result 
to arbitrary torsion-free A-modules? 


3. Derive the theory of canonical forms for endomorphisms of finite-dimensional 
vector spaces from Corollary 8. 


4. Determine the elementary divisors of the following matrix: 


e 73x3) 


own, 
HH bo CO 


6 
1 
5 


5. Let aii,...,@in € A be elements such that ged(ai1,...,@1n) = 1. Show that there 
are elements ajj € A, i = 2,...,n,j =1,...,n, such that the matrix (a;;)j j=1,4n 
is invertible in A(™*”), 


6. Consider an A-homomorphism f: L —+ M between finitely generated free 
A-modules. Show: 


(i) There exists a free submodule F C LE such that L = ker f @ F. 


(ii) There exist bases 71,...,%mp of LD and y1,...,Yn of M, as well as elements 
ay,...,a- € A—{O}, r < min{m,n}, such that f(x;) = ayy; for i =1,...,r 
and f(x;) =0 for i > r. In addition, we can obtain the divisibility relations 
a;|aj41 for 1 <i<r. 


7. Give a simple argument for extending the assertion of Theorem 2 to finite-rank 
submodules M of free A-modules F that are not necessarily of finite rank them- 
selves. 


3. Algebraic Field Extensions ®) 


Check for 
updates 


Background and Overview 


First let us indicate how algebraic equations are related to algebraic field ex- 
tensions. We start with the simple case of an algebraic equation with rational 
coefficients, say f(x) = 0, where f € Q[X] is a monic polynomial of degree > 1. 
The problem of where to look for solutions of such an equation and how to use 
them in computations will be postponed for the moment, since we will assume 
that the fundamental theorem of algebra is already known. Thus, we will use 
the fact that f admits a zero a in C, where now f(a) = 0 has to be interpreted 
as an equation valid in C. However, to better describe the “nature” of the zero 
a, one tries to construct a domain of numbers, as small as possible, where the 
equation f(a) = 0 still makes sense. For example, such a domain is given by 
the smallest subring of C containing Q and a, hence by 


Qla] = {g(a); g € Q[X]}. 


Using the epimorphism y: Q[X] —> Q[a], g > g(a), it is easily seen that 
Q[a] is even a field. Indeed, Q[X] is a principal ideal domain, and hence 
ker vy is a principal ideal, say ker y = (q). Of course, f € ker y shows that q is 
nonzero and therefore can be assumed to be monic in Q[X]. Now the fundamen- 
tal theorem on homomorphisms 2.3/5 implies that y induces an isomorphism 
QLX]/(q) > Q[a], and it follows from 2.3/8 that g is a prime polynomial, 
the so-called minimal polynomial of a. If f is irreducible, we can even conclude 
that gq = f by a divisibility argument. In any case, the ideal (q) is maximal in 
Q[X] by 2.4/6, so that Qfa] ~ Q[X]/(q) is indeed a field. We say that Q[a] 
arises from Q by adjoining the zero a. In the same way, one can adjoin further 
zeros of f (or of other polynomials with coefficients in Q[a]) to Q[a]. 

We can draw some important conclusions from these observations. First of 
all, we see that Q[a], as a Q-vector space, is of finite dimension and hence 
that Q C Q[a] is a finite field extension; cf. 3.2/6. Using a simple dimension 
argument from linear algebra, this implies that every element of Q[a] may be 
viewed as the solution of an algebraic equation with coefficients in Q, hence that 
QC Q[a] is an algebraic field extension, as we will say; cf. 3.2/7. In particular, 
when looking at the extension Q C Q[a], we are dealing with a whole class of 
related algebraic equations simultaneously. 
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In the following we assume that the polynomial f € Q[X] is irreducible; 
let ay,...,Q@, € C be its zeros. Then we obtain for every i = 1,...,n an 
isomorphism Q[a;] ~ Q[X]/(f), as constructed before, with a; corresponding 
to the residue class of X modulo (f). In particular, given two indices i, j, there 
is an isomorphism o;;: Q[a;] > Q[a;] such that o;;(a;) = aj, and we see 
that all zeros of f are of equal priority. The isomorphisms oj; open up a first 
view on the Galois theory of the equation f(x) = 0. In the special case that the 
subfield L = Q[a;] C C is independent of i, the o;; constitute (not necessarily 
distinct) automorphisms of L, these being the members of the Galois group of 
the equation f(z) = 0. In general, one considers instead of Q[a;] the so-called 
splitting field L = Q[ay,...,Q,] of f, which is constructed from Q by adjoining 
all zeros of f. One can show, using the primitive element theorem 3.6/12, that 
there is an irreducible polynomial g € Q[X] with zeros 6),...,6, € C such that 
L=Q[8;] for 7 =1,...,r. Then we are in the situation of the special case, as 
just considered, and we can define the Galois group of the equation f(x) = 0 
as the corresponding group of the equation g(x) = 0. 


Up to now we have restricted ourselves to field extensions of Q. But how can 
we proceed when we want to replace Q by an arbitrary field kK? In principle, no 
changes are necessary, as we will see in the present chapter. The only auxiliary 
tool we might need is a certain replacement of the fundamental theorem of alge- 
bra. To begin with, we study in 3.2 the relationship between finite and algebraic 
field extensions, without departing from special algebraic equations that are to 
be solved; a generalization of these considerations to the ring-theoretic level is 
presented in 3.3. Then, in 3.4, we approach the problem of constructing for an 
irreducible algebraic equation f(x) = 0, where f € K[X], an extension field 
L of K such that f admits a zero a in L. If L is such a field, we can consider 
the field K [a] as before; it is isomorphic to A[X]/(f), since f is irreducible. 
On the other hand, we can define L simply by K[X]/(f), observing that the 
residue class of X is a zero of f; this is Kronecker’s construction; cf. 3.4/1. The 
construction allows us to successively adjoin zeros of polynomials to kK’. For ex- 
ample, if we have adjoined a zero a; of f to K, then f admits a factorization of 
type f = (X —az) fi in K[a,][X]. In a next step, we can adjoin a zero Qe of fi 
to K[a,], and so on. In this way, we obtain after finitely many steps a splitting 
field L of f, i-e., an extension field of K’, over which f factorizes completely into 
linear factors, and which is minimal in the sense that it is obtained by adjoining 
all zeros of f to K. 


Although in principle, Kronecker’s construction is sufficient for the study 
of algebraic equations, it is nevertheless desirable for several reasons to obtain 
a “true” substitute for the fundamental theorem of algebra. Therefore, we con- 
struct in 3.4 an algebraic closure K of K. In fact, we use a method of E. Artin 
that allows us to adjoin to K all zeros of the polynomials in K [|X] in one step. 
The field K is algebraic over K and has the property that every nonconstant 
polynomial in K[X] factorizes completely into linear factors. This construc- 
tion makes it possible to talk about “the” zeros of f. Then, for example, the 
construction of splitting fields of a family of polynomials, as considered in 3.5, 
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no longer poses any problems. We thereby arrive at the notion of normal field 
extensions, a prestage of Galois extensions. 

Finally, let us mention the phenomenon of inseparability, which occurs when 
one is dealing with fields K of characteristic > 0 instead of extension fields of 
Q. The characteristic of K is defined as the smallest integer p > 0 such that 
p-1 = 0, where we put p = 0 if such a number does not exist; cf. 3.1. A 
polynomial f € K[X] is called separable if it admits only simple zeros (in 
an algebraic closure of A’), and purely inseparable if it admits precisely one 
zero, the latter being of multiplicity deg f. Irreducible polynomials over fields 
of characteristic 0 are always separable, while the same is not true over fields 
of characteristic > 0. Based on the corresponding notion for polynomials, we 
study separable algebraic field extensions in 3.6 and, as their counterpart, purely 
inseparable extensions in 3.7. Of special interest are the results 3.7/4 and 3.7/5, 
asserting that it is possible to split up algebraic field extensions into a separable 
and a purely inseparable part. As an example, we study in 3.8 special fields of 
characteristic > 0, namely finite fields. 

The chapter closes in 3.9 with a first look at the beginnings of algebraic 
geometry, i.e., the theory of algebraic equations in several variables. 


3.1 The Characteristic of a Field 


Given a ring K, there exists a unique ring homomorphism 
y:Z— K, 


namely, the one characterized by n +> n- 1. By the fundamental theorem on 
homomorphisms for rings 2.3/4, it gives rise to a monomorphism Z/ ker yp @ K, 
where ker y is a principal ideal in Z by 2.4/3. If K is an integral domain, for 
example a field, then Z/ ker y is an integral domain as well, and it follows that 
ker vy is a prime ideal. Hence, ker y is either the zero ideal or an ideal generated 
by a prime number p; cf. 2.3/11. Accordingly, 0 or p is called the characteristic 
of the integral domain or the field Kk. 


Definition 1. Let K be a field (or, more generally, an integral domain) and let 
yp: Z — K be the canonical ring homomorphism discussed before. Furthermore, 
let p € N be a generating element of the principal ideal ker y. Then p is called 
the characteristic of K, and we write p = char K. 


The fields Q, R, C are all of characteristic 0, whereas for a prime number 
p the field F, = Z/pZ consisting of p elements is of characteristic p. A subring 
T of a field K is called a subfield if T is a field itself. Of course, we have 
char K = char T in this case. Since the intersection of subfields of a field K is a 
subfield again, it follows that AK contains a unique smallest subfield P, namely 
the one given by the intersection of all subfields in K. The field P is referred to 
as the prime (sub) field in K. 
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Proposition 2. Let K be a field and P C K its prime subfield. Then: 

(i) char K = p>0<— > P/F, for p prime. 

(ii) char kK =0= >} P~Q. 

In particular, up to isomorphism, the only prime subfields that can occur, 
are F,, for a prime number p, as well as Q. 


Proof. We have char F,, = p, as well as char Q = 0. Since char P = char Kk, we 
get char kK = p from P ~ F,, and char kK = 0 from P ~ Q. This justifies in 
each of the cases (i) and (ii) the implication “<=”. 

To verify the reverse implications consider the canonical ring homomor- 
phism y: Z —> K; it factors through the prime subfield P C K, so that 
imy C P. If char K is a prime number p, we have ker y = (p), and the image 
imny ~ Z/(p) is a field by 2.3/6 or 2.4/6. Since P is the smallest subfield of K, 
we get imy = P, and hence P ~ F,. Otherwise, if char K = 0, we see that im y 
is isomorphic to Z. In particular, the field of fractions Q(im vy) is a subfield of 
P that is isomorphic to Q, and it follows that P = Q(imy) ~ Q. 


Working in characteristic p > 0, we want to point out that the binomial 
expansion for p-powers of a sum of two elements takes a particularly simple 
form. 


Remark 3. Let p be a prime number and R an integral domain of characteristic 
p (or, more generally, a ring satisfying p- 1=0). Then, for elements a,b € R 
and r EN, the binomial formula takes the following form: 


T 


(a+b)? =a? +B, (a— 0b)? =a? — BF. 


Proof. Using induction on r, the assertion is easily reduced to the case r = 1. 
Now recall from example (2) at the end of Section 2.8 that the following divis- 
ibility relations hold: 


p|(?) for v=1,...,p—1. 


In particular, the specified binomial coefficients will vanish in R. Hence, the 
first of the asserted formulas follows for r = 1. If we use for p even, i.e., for 
p = 2, that 1 = —1 holds in R, then also the second formula is clear. 


If K is a field of characteristic p > 0, then Remark 3 shows that the map 
o:k > K, ara, 


respects the addition on K. Therefore, it defines a homomorphism of fields, the 
Frobenius homomorphism of K. 


Exercises 


1. Can there exist homomorphisms between fields of different characteristic? Con- 
sider the same problem for integral domains. 
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2. Does there exist a field consisting of six elements? Does there exist an integral 
domain consisting of six elements? 


3. For a finite field K with multiplicative group K*, consider H = {a?; a € K*} 
as a subgroup of K* and show that 


K* if char K = 2, 
a subgroup in K* of index 2 ifchar K > 2. 


4. Let K be a field of characteristic > 0. Show that the Frobenius homomorphism 
o: kK —+ K is an automorphism if K is finite. Check whether this assertion 
extends to the case that K is not necessarily finite. 


5. Explicitly specify the Frobenius homomorphism on F,. 


3.2 Finite and Algebraic Field Extensions 


A pair of fields K C L, where K is a subfield of L, is called a field extension. 
More specifically, we will say that L is a field extending Kk or an extension field 
of K, or simply that L is a “field extension” of A. Given such a field exten- 
sion, we can restrict the multiplication on L to a multiplication K x L — L 
and thereby view L as a K-vector space. For field extensions K C L we will 
often use the notation L/K, at least if no confusion with factor group or factor 
ring constructions is possible. Dealing with field extensions L/K, we will also 
consider intermediate fields, i.c., subfields F such that kK C EC L. 


Definition 1. Let kK C L be a field extension. Then the vector space dimension 
[L : K] := dimg L is called the degree of L over K. The field extension is 
called finite or infinite, depending on whether [L: K] is finite or infinite. 


Note that ZL = K is obviously equivalent to [L: K] = 1. 


Proposition 2 (Multiplicativity formula). Let K C LC M be field extensions. 
Then 
[M:K]=[M:1L]-[L: kK]. 


Proof. If one of the degrees is infinite, the equation has to be interpreted in 
the obvious way. However, most interesting is the case in which both [M : L}] 
and [L : K] are finite. In the latter case choose vector space bases 21,...,Um 
of L over K, as well as yj,...,Yn of M over L. To verify the degree formula 
[M: kK] =[(M: L]-[L: K] = mn it is enough to check that the elements 
Liyj,t=1,...m,j =1,...,n, form a basis of M over K. In a first step we show 
that the linear independence of the x; over K in conjunction with the linear 
independence of the y; over L implies the linear independence of the x;y; over 
Kk. To do this, consider coefficients c;; € K satisfying Xe Cytiyj = 0. Then we 
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can write the left-hand sum as a linear combination of the y; with coefficients 


in L, namely 

302 cist) Uy = 0. 

j=l ‘i=l 
Since the elements y; are linearly independent over L, we get }°,¢,;7; = 0 for 
all 7. In the same way we conclude that ¢,; = 0 for all 7 and j, since the 2; 
are linearly independent over K. Hence, it follows that the x;y; are linearly 
independent over K. 

Just as easily we can see that the x;y; form a system of generators for 
over KY. Indeed, every element z € M admits a representation z = we) Cis 
with coefficients c; € L, since the y; form a system of generators for M over 
L. Further, for each j there is a representation c; = ou cv; With coefficients 
cy € K, since the x; form a system of generators for L over kK’. Thereby we get 


nom 
z= ) ) Cig ViYj 


j=l i=l 


and see that the x;y; form a system of generators, thus, altogether, a basis of 
M over K. 

It remains to look at the case that at least one of the extensions M/L 
and L/K is not finite. In the first step of the proof we have shown for ele- 
ments 21,..-,%, € L that are linearly independent over K, and for elements 
Y1y+++5Yn © M that are linearly independent over L, that the products x;y; are 
linearly independent over K. In other words, [L : kK] >m and [M: L] >n 
imply [M : K] > mn. Therefore, [M : K] is infinite as soon as one of the 
degrees [M : L] and [L: K] is infinite. 


Corollary 3. If K C LC M are field extensions such that p= [M : K] is 
prime, then L= K or L=M. 


Examples of finite field extensions of degree 2 are given by the extensions 
Rc Cand Qc Q[v2], where we view Q[V2] as a subring of R. On the other 
hand, the extensions Q C R as well as K C K(X) = Q(K[X]) for an arbitrary 
field K and a variable X are infinite. 


Definition 4. Given a field extension K C L, an element a € L is called 
algebraic over K if it satisfies an algebraic equation over K, i.e., an equation 


a’ +can t+ ...4¢, =0 


with coefficients c1,...,Cn € K. This means that the kernel of the substitution 
homomorphism 


py: K[X] +L, = g-+ g(a), 


is nontrivial. Otherwise, a 1s called transcendental over K. 
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Furthermore, the extension field L is called algebraic over K if every ele- 
ment a € L is algebraic over Kk. 


For example, the nth root ¥/q € R for some q € Q, ¢ > 0, and n € N—{O}, is 
algebraic over Q, since it is a zero of the polynomial X" — q € Q[X]. Similarly, 
the complex number e?*/” is an “nth root of unity” and therefore algebraic 
over Q. However, in general it is not easy to decide whether a complex number 
z is algebraic over Q, notably when z is constructed by means of methods from 
analysis; for example, see the transcendence problem for the numbers e and 7 
that was mentioned in the introduction. 


Remark 5. Let kK C L be a field extension and a € L an element that is 
algebraic over K. Then there exists a unique monic polynomial f € K[X] of 
smallest degree such that f(a) = 0. The kernel of the substitution homomor- 
phism 

gp: K[xX] - L, gr > g(a), 


satisfies ker p = (f), so that in particular, f is prime and therefore irreducible. 
The polynomial f is called the minimal polynomial of a over K. 


Proof. Recall that KX] is a principal ideal domain by 2.4/3. Therefore, the 
ideal ker y is generated by a polynomial f € K[X], where f #4 0, since a is 
algebraic over K. Furthermore, such a generator is unique up to a multiplicative 
constant from K*. Hence, if we require f to be monic, it becomes unique, and we 
see that f is the unique monic polynomial of smallest degree in AX ] such that 
f(a) = 0. Now observe that im y is a subring of L and hence an integral domain. 
Furthermore, im is isomorphic to A[X]/(f) by the fundamental theorem on 
homomorphisms 2.3/5. Therefore it follows that f is prime, and in particular, 
irreducible; cf. 2.3/8 and 2.4/6. 


Proposition 6. Let K C L be a field extension and a € L algebraic over K with 
minimal polynomial f € K(X]. Writing K[a] for the subring of L that is gen- 
erated by a and K, i.e., for the image of the homomorphism yp: K[X] — L, 
gt g(a), it follows that p induces an isomorphism K[X]/(f) ~> K[a]. 

In particular, K[a] is a field, and in fact, a finite field extension of K of 
degree [K[a] : K] = deg f. 


Proof. We have K[a] = imy ~ K[X]/(f), due to the fundamental theorem 
on homomorphisms. Since ker y = (f) is a nonzero prime ideal in K[X], we 
conclude from 2.4/6 that (f) is a maximal ideal in A [X]. Therefore, kK |X]/(f) 
and hence K [a] are fields. 

It remains to show that 


dimg K[X]/(f) = deg f. 


Assume that f = X" +¢X"!+...+c, and hence deg f = n. Furthermore, 
observe that Euclidean division by f is unique in A [X] in the sense that for 
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every g € K[X] there are unique polynomials g,r € KX] such that 
g=of +7, degr <n; 


cf. 2.1/4. Thus, writing X € K[X]/(f) for the residue class of X € K[X], it 
follows that every element of kK LX]/(f), viewed as a vector space over K’, admits 
a unique representation as a linear combination with coefficients in AK of the 
elements X°,...,X"-!. Then X°,..., X""! form a K-basis of K[X]/(f), and 
using the isomorphism K [a] ~ KX]/(f), we can see that a®,...,a"~! form a 
K-basis of K [a]. In particular, we get dimg K[X]/(f) =dimg K[a] =n. 


Let us consider a simple example. Choosing a prime number p and an integer 
n € N— {0}, the nth root 7/p € R is algebraic over Q, so that Q[ x/p] is a 
finite field extension of Q. The polynomial f = X”" — p € Q[X] is irreducible 
by Eisenstein’s criterion 2.8/1 and admits 7/p as a zero. Since it is monic, it 
must coincide with the minimal polynomial of 7/p. Consequently, we get 


[O[ ve] : Q| = deg f =n. 


As a by-product we conclude that the extension R/Q cannot be finite. Indeed, 
there are intermediate fields Q[ 7/p] of arbitrarily large degree. 


Proposition 7. Every finite field extension K C L is algebraic. 


Proof. Let [L : K] = n and consider an element a € L. Then the powers 
a,...,a” give rise to a system of length n +1 and hence to a system in L that 


is linearly dependent over K. It follows that there exists a nontrivial equation 
ca? +...+c,a% =0 


with coefficients c; € A’. Now let n’ be maximal among all indices i € {0,...,n} 
such that c; £ 0. Multiplying the equation by a suitable nonzero constant in 
Kk, we may assume that c,, = 1 and thereby get an algebraic equation for a 
over K. 


At the end of the present section we will give an example of an algebraic field 
extension that is not finite. Thereby we see that the converse of Proposition 7 
does not hold. 

If K Cc Lis a field extension and 2 = (a;)icr a system of elements in L 
(or a subset of L), we can consider the subfield A (2l) C L that is generated by 
4 over K. It is the smallest subfield of Z containing K as well as all elements 
a;, ie., A (20) is the intersection of all subfields of L that contain A’ and the 
a;. Given a field extension K C JL, there is always a system 2 of elements in L 
such that L = K (20); for example, take for 21 the system of all elements in L. 
We want to explicitly describe the subfield K(a1,...,a,) C L that is generated 
over K by finitely many elements a1,...,@, € L. Of course, it will contain the 
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ring K[ay,...,@,] of all polynomial expressions f(a1,...,Qn) for polynomials 
f € K[X,...,X,] and then also its field of fractions, so that 


K(a4,...,Qn) = Q(K[ai,...,@n]). 
Thereby we see that A’(a1,...,@,) consists of all quotients of type 


f(ai,-.-,Qn) 


g(Q1,-.-,Qn) 


? 


where f,g € K[X1,...,Xn], g(ai,---,Qn) # 0. For an arbitrary system 
MA = (a;)ier of elements in L, the field A(2l) can be described in a similar 
way, working with polynomials in K[%] for a system of variables ¥ = (Xi)iez. 
Alternatively, we can interpret A (2l) as the union of all subfields of L of type 
K(aj,,---,Q;,), where i;,...,i, € I. 


Definition 8. A field extension K C L is called simple if there exists an element 
a € L such that L = K(a). The degree [K(a): K] is referred to as the degree 
of a over K. 

A field extension K C L is called finitely generated if there exist finitely 
many elements ay,...,Q, € L such that L = K(ay,...,Qn). 


Proposition 9. Let L = K(ay,...,Qn) be a finitely generated field extension 
of K. Assume that a1,...,Q, are algebraic over K. Then: 

(i) L= K(ay,...,Q@n) = K[ay,..., an]. 

(ii) L is a finite, and in particular algebraic, field extension of K. 


Proof. We conclude by induction on n. The case n = 1 was already dealt with 
in Proposition 6. Therefore, let n > 1. We may assume by the induction hy- 
pothesis that K[a1,...,Qn-1] is a finite field extension of K. Furthermore, 
it follows from Proposition 6 that K[a,,...,Q,] is a finite field extension of 
K[aa,..-,Qn-1]. Then K[ay,...,Qn] is also finite over AK by Proposition 2, 
and in particular algebraic over K by Proposition 7. Since K[a1,...,Qn] is 
already a field, A(a4,...,@,) must coincide with K[a1,...,Q@n]. 


The above proposition includes the nontrivial assertion that a simple field 
extension L/K that is generated by an algebraic element is algebraic itself in 
the sense that every element of L is algebraic over K’. For example, using this 
fact we can easily see for n ¢ N—{0} that the real number cos ® is algebraic 
over Q. Indeed, cos = = $(e"/” + e-™/") is contained in Q(e™/”), where e™/” is 
algebraic over Q, since it is a 2nth root of unity. Since a finite field extension 
L/K is always finitely generated, for example by a K-basis of L, we obtain as 
a combination of Propositions 7 and 9 the following corollary: 


Corollary 10. For a field extension K C L, the following assertions are equiv- 
alent: 
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(i) L/K is finite. 
(ii) L/K is generated by finitely many elements that are algebraic over K. 
(iii) L/K is a finitely generated algebraic field extension. 


If 2 = (a;)ier is a system of generators of a field extension L/K, then L 
is the union of all its subfields of type K(aj,,...,a:,), where i1,...,i, € I. 
In particular, we can conclude from Corollary 10 that L/K is algebraic as 
soon as all a; are algebraic over K. Therefore, we can derive the following 
characterization (of not necessarily finitely generated) algebraic field extensions: 


Corollary 11. For a field extension K C L the following assertions are equiv- 
alent: 

(i) L/K is algebraic. 

(ii) L/K is generated by elements that are algebraic over K. 


Next, let us show that algebraic field extensions are transitive in a natural 
way. 


Proposition 12. Let K C LC M be field extensions. If a € M is algebraic 
over L and if L/K is algebraic, then a is algebraic over K as well. In particular, 
the field extension M/K is algebraic if and only if M/L and L/K are algebraic. 


Proof. Let f = X" +X" !+...+c, € LX] be the minimal polynomial of 
a over L. Then a is algebraic over the subfield K(c1,...,¢,) of L, and we can 
conclude from Proposition 6 that 


A Gipsy) Cis) < 00. 


Moreover, we have 
Leip ecg ae | <0 


due to Proposition 9, and hence 
Cre) : K] < 00 


due to Proposition 2. But then K(c1,...,¢n,@) and in particular a are algebraic 
over K by Proposition 7. 

The argument just given shows that M/K is algebraic if M/Z and L/K are 
algebraic. The converse of this is trivial. 


Finally, let us give an example of an algebraic field extension that is not 
finite and hence cannot be finitely generated. We consider 


L= fa € C; a is algebraic over Q} 


as a subfield of C extending Q. Indeed, LF is a field, since for a, 6 € L we get 
Q(a, 6) C L from Proposition 9. By its definition, L/Q is algebraic. Further- 
more, [L : Q] = ov, since L contains Q(7/p) for n € N—{0} and p prime as 
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a subfield and since, as we have seen, Q(x/p) is of degree n over Q. We write 


L = Qand call it the algebraic closure of Q in C. 
Exercises 
1. Let L/K be a field extension. Discuss the problem of showing for two elements 
a,b € L that are algebraic over K that their sum a+ b is algebraic over K as 
well. 
2. Characterize algebraic field extensions in terms of finite field extensions. 


3. Show that every element in C — Q is transcendental over Q. 


10. 


11. 


12. 


13. 


Let L/K be a finite field extension such that p = [L: K] is prime. Show: There 
exists an element a € DL such that L = K(a). 
Let L/K be a finite field extension of degree [L : K] = 2*. Let f € K[X] bea 
polynomial of degree 3 having a zero in DL. Show that f admits a zero already in 
K. 
Show that a field extension L/K is algebraic if and only if every subring R 
satisfying K C RC Lisa field. 
Let L/K be a finite field extension. Show: 
(i) For a € L, the minimal polynomial of a over K coincides with the minimal 
polynomial of the K-vector space homomorphism ya: L —> L, «+> ax. 
(ii) If L = K(a), the minimal polynomial of a over K coincides with the char- 
acteristic polynomial of (aq. 
(iii) For a € L, the characteristic polynomial of , is also referred to as the field 
polynomial of a, relative to the field extension L/K’. It is always a power of 
the minimal polynomial of a over K. 
Let a € C satisfy a? + 2a —1=0, so that it is algebraic over Q. Determine the 
minimal polynomial of a as well as that of a? + a, in each case over Q. 
Let K be a field and x an element of an extension field of A, and assume that x 
is transcendental over A’. Show for n € N—{0} that 2” is transcendental over Kv 
and that [K (x): K(a2")] =n. 
Let L/K be a field extension and let a € L be algebraic over K. Show for 
n € N—{0} that [K(a”) : K] > 4+[K(a): K]. 
Let K bea field and K(X) the function field in one variable X over kK. Consider 
a rational function g = f/g € K(X)—K, where f,g are coprime polynomials in 
EK LX]. Show that q is transcendental over K and that 
[K(X) : K(q)] = max(deg f, deg g). 


Determine the minimal polynomial of X over K(q). Hint: Use Exercise 3 in 
Section 2.7. 

Let L/K be a field extension. Show that two elements a, 3 € L are algebraic over 
K if and only if a+ and a: £ are algebraic over K. 

Consider two complex numbers a, 3 € C as well as exponents m,n € N such that 
gcd(m,n) = 1 and a™ = 2, 8" = 3. Show that Q(a, 6) = Q(a- 8) and determine 
the minimal polynomial of a- 6 over Q. 
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3.3 Integral Ring Extensions* 


In many respects, the theory of integral ring extensions to be dealt with in the 
present section can be viewed as a generalization of the theory of finite and 
algebraic field extensions, which was presented in Section 3.2. Even if one is 
primarily interested in the case of fields, the more general scope of ring theory 
leads to new insight, as we will see, for example, below in Corollary 8. We used 
vector spaces over fields as a technical tool in 3.2. In a similar way, we will rely 
on modules when dealing with ring extensions. See Section 2.9 for the definition 
of modules over rings. 


Given a ring extension R C R’, the inclusion map R @ R’ can always 
be viewed as a ring homomorphism. Instead of restricting ourselves to ring 
extensions, we will look in the following at the more general case of ring ho- 
momorphisms. For every ring homomorphism y: A —> B, we can view B ina 
natural way as an A-module; just multiply elements a € A by elements b € B 
by carrying out the product y(a)b in B. We say that y is finite if B is a finite 
A-module under y; we will also say in this case that B is finite over A or, if 
y is an inclusion homomorphism, that the extension A > B is finite. Further- 
more, y as well as B over A, resp. the ring extension A © B, are said to be 
of finite type if there exists an epimorphism ®: A[X1,...,Xn] —> B from a 
polynomial ring in finitely many variables over A to B that extends vy. Note that 
every finite ring homomorphism is, in particular, of finite type. That a ring ho- 
momorphism vy: A —> B is of finite type can also be characterized by the fact 
that there exist elements 21,...,0, € B satisfying B = y(A)[«41,...,2]. Here 
p(A)[21,...,;@n] C B indicates the subring of all expressions f(x1,...,%,) for 
polynomials f € y(A)[X1,...,Xn], as explained in 2.5. By abuse of notation, 
this ring will also be denoted by A[71,...,2n]. 

In the preceding situation it is convenient to employ the terminology of 
algebras. An algebra B over a ring A is just a ring homomorphism A —+ B. In 
particular, we can talk about (module-)finite A-algebras or about A-algebras of 
finite type. Also note that a homomorphism between two A-algebras B and C’ 
is meant as a ring homomorphism B —> C with the additional property that 
it is compatible with the defining homomorphisms A —> B and A —> C, in 
the sense that the diagram 


Sa 


A 


is commutative. 

By its definition, a field extension K C L is finite if and only if it is a 
finite ring extension. Note that a similar assertion for finitely generated field 
extensions and ring extensions of finite type does not hold in general. Of course, 
a field extension kK C L is finitely generated if it is a ring extension of finite 
type. However, the converse of this fails to be true, as we will see at the end of 
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the present section. As a next step, we want to extend the notion of algebraic 
field extensions to the context of rings. 


Lemma 1. For a ring homomorphism yp: A —> B and an element b € B, the 
following conditions are equivalent: 
(i) There exists an integral equation of b over A, i.e., an equation of type 
f(b) =0 for a monic polynomial f € ALX]. 
(ii) The subring A[b] C B, viewed as an A-module, is finitely generated. 
(iii) Viewing B as an A-module, there exists a finitely generated submodule 
M =U, Am; C B such that 1€ M and bM CM. 


Proof. We start with the implication (i) ==> (ii). So assume there is an equation 
f(b) =0 for a monic polynomial f €¢ ALX], say 


b+ ab" t+...4+a4, =0. 


Then 6” is an element of the A-module M = yo. Ab’, and we see by induction 
that 6’ € M for all i € N. Therefore, we get A[b] C M and hence A[b] = M. 
In particular, A[6] is a finitely generated A-module, and (ii) follows. 

The implication (ii) => (iii) is trivial. Thus, it remains to verify the implica- 
tion (iii) => (i). Let M = 5°", Am; C B be a finitely generated A-submodule 
of B such that 1 © M and bM C M. The latter inclusion shows that there is a 
set of equations 


bm, =aym+...+ Ain™Mn, 


bmn = Gnym +... + Ann Mn 
with coefficients a;; € A that in terms of matrices can be rewritten as 
my 
Mn 
given by 6,; = 1 for 7 = j, and 6;; = 0 for i 4 7. Now we apply Cramer’s rule, 
i.e., the relation 
(x) A*. A= (det A)-E 


involving the adjoint matrix A* € B"*" of A, the determinant of A, as well as 
the unit matrix F € B"*"; see, for example, [4], Satz 4.4/3. This equation is 
established in linear algebra for matrices with coefficients in a field, but we claim 
that it naturally extends to the more general setting of coefficients in rings, as 
is needed here. Indeed, comparing the coefficients of the matrices occurring in 
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(*) on the left-hand and the right-hand sides, the equation (*) consists of a 
system of polynomial identities between the coefficients of A. More generally, 
viewing the coefficients cj; of A as variables, these identities can be formulated 
over the polynomial ring Z[c;;]. They can then be derived from the classical 
case of fields by embedding Z[c;; | into its field of fractions Q(c;;). 

Now using Cramer’s rule («), we get 


my my 
(det A)- |; = A*. A. - | =0 
Mn Mn 
and hence (det A) -m,; = 0 for i = 1,...,n. Since the identity element 1 © B 


is a linear combination of the elements m; with coefficients in A, we conclude 
that det A = (det A) - 1 = 0. Therefore, 


det(d;;X ie aij) E A[X] 


is a monic polynomial admitting b as a zero, as desired. 


Definition 2. Let y: A — B be a ring homomorphism. An element b € B 
is called integral over A with respect to yp if b and ~ satisfy the equivalent 
conditions of Lemma 1. Moreover, we say that B is integral over A or that y 
is integral if every element b € B is integral over A in the way just described. 


It is obvious that the notions “integral” and “algebraic” coincide if we re- 
strict ourselves to field extensions. Furthermore, by establishing the equivalences 
of Lemma 1, we have already exhibited the crucial relations between integral 
and finite ring extensions. Below we formulate some special consequences of 
these that may be viewed as generalizations of the results 3.2/7, 3.2/9, and 
3.2/12. 


Corollary 3. Every finite ring homomorphism A —> B is integral. 


Proof. Use condition (iii) of Lemma 1 for M = B to see that A —> B is 
integral. 


Corollary 4. Let py: A —> B be a ring homomorphism of finite type and 
assume B= A[b,,...,b,] for elements b,...,b, € B that are integral over A. 
Then A —> B is finite and, in particular, integral. 


Proof. Consider the chain of “simple” ring extensions 
p(A) c y(A) [bi] C... C p(A)[bi,...,6-] = B. 


Each of these is finite by Lemma 1, and we easily conclude by induction that 
B is finite over A. Indeed, to carry out the induction step, just multiply the 
elements of a module generating system of B over y(A)[bi,...,0--1] by those 
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of a corresponding system of y(A)[bi,...,,-1] over A that is provided by the 
induction hypothesis. Thereby one obtains a module generating system of B 
over A, as is seen using a similar argumentation to that given in the proof of 
3.2/2. 


Corollary 5. Let A —> B and B —+ C be two finite (resp. integral) ring 
homomorphisms. Then their composition A —+ C is also finite (resp. integral). 


Proof. To settle the assertion for finite homomorphisms, we use the same argu- 
ment as the one applied in the induction step of the proof of Corollary 4. Hence, 
it remains to consider the case of integral homomorphisms. Therefore, assume 
that A —> B and B — C are integral and consider an element c € C. Then 
c satisfies an integral equation over B, say 


+ bc" t+... +b, =0, by,.--,bn € B, 


and we can conclude that c € C is integral over A[b,,...,b,]. In particular, the 
extension A[b1,...,6n] —> Al[bi,...,bn,c¢] is finite, as we see from Corollary 4. 
The same result also shows that A —> A[b),...,6n] is finite, so that altogether 
A —> Albi,...,bn,c] is finite. But then this homomorphism is integral as well, 
see Corollary 3, and it follows that c is integral over A. Finally, letting c vary 
over C, it follows that A —> C is integral. 


Next we want to prove a fundamental theorem that significantly clarifies 
the structure of algebras of finite type over fields. An analogue for field exten- 
sions, namely the decomposition of an arbitrary field extension into a purely 
transcendental and an algebraic one, will be dealt with in 7.1. 


Theorem 6 (Noether normalization). Let K be a field and K > B a nonzero 
K-algebra of finite type. Then there exist elements 11,...,2, € B such that B 
is finite over the subring K[a1,..., tr] C B, while the system x1,...,t, € B is 
algebraically independent over K (cf. 2.5/6). 


In other words, there exists a finite monomorphism K[Xq,...,X,] > B of 
K-algebras, where K[X,...,X,] is a polynomial ring in finitely many variables 
over K. 


Proof. Let B = K[bi,...,6n] for certain elements ),...,b, € B. If bi,...,0n 
are algebraically independent over AK’, nothing has to be shown. Assuming the 
contrary, there exists a nontrivial relation of type 


(*) YW ote So 


(...Un)EL 


with coefficients a,,..,, € K*, where the summation extends over a finite set 
I of n-tuples (14,...,U,) € N”. Now introduce new elements x1,...,%n_1 € B, 
say 


98 3. Algebraic Field Extensions 


a S1 zh — Sp—1 
21 =b,—d7,  ..., Ent = bp-1 — BFP, 


for certain exponents $1,...,5,-1 € N — {0} whose choice has still to be made 
precise. Then, in any case, we get 


B= K[bi,...,6n] = K[a1,..-,2n—1, bp]. 


Substituting b; = x;+b%' for i = 1,...,n—1 in the relation (*) and decomposing 
powers b/* = (x; + bf)” into the sum of b*” and terms of lower degree in by, 
we get a new relation of type 


(x) tip pe + f(x, ee bn) = 0), 


(1... )EL 


Here f(21,..-,;%n—1,0n) is a polynomial expression in b, with coefficients in 
K[21,-.-,;%n_1] such that the corresponding degree in b, is strictly less than 
the maximum of all sums 514 +...+ S,-1Yn—-1 + Vp for (4,...,Un) € I. As is 
easily checked, the integers s1,...,5,—; € N can be chosen in such a way that the 
exponents 5); +...+5,-1V¥,—1 + Vp for index tuples (14,...,%,) € I occurring 
in (**) are all different. Indeed, choose t € N bigger than the maximum of all 
indices ,...,Uy for (4,-.-,Up,) € I and let 


§, = tu, see Sy 1 = tl. 


Now view the relation (**) as a polynomial equation in b, with coefficients 
in K[a1,.--,%n-1]. Then it follows that there is a term of type ab% with a 
coefficient a € K* whose degree N strictly dominates the degrees of all other 
terms. In particular, multiplying it by a~!, the equation (**) can be read as 
an integral equation of b, over K[21,...,%n—1], and we see from Corollary 4 
that the extension K[21,...,%,-1] © B is finite. If x,,...,2, 1 happen to be 
algebraically independent over K’, we are done. Otherwise, we can apply the just 
described process anew to the ring K[21,..., 2,1]. Continuing in this way, we 
finally arrive at a system 21,...,2, that is algebraically independent over Kk. 
That the inclusion K[2,,...,2,] <> B is finite follows from Corollary 5. 


One can show that the integer r occurring in the theorem on Noether 
normalization is unique; it is the so-called dimension of the ring B. For an 
integral domain B, the uniqueness of r can easily be deduced from a corre- 
sponding uniqueness assertion on the transcendence degree of field extensions; 
cf. 7.1/5. Looking ahead to Section 7.1, we want to briefly explain this. In- 
deed, if x1,...,2, € B are algebraically independent over K and the extension 
K[2,...,%,] © B is finite, then the field of fractions Q(B) is algebraic over 
the purely transcendental extension K(a1,...,2,) of kK. Therefore, 21,..., 2, is 
a transcendence basis of Q(B)/K, cf. 7.1/2, and we have transdeg, Q(B) =r. 

As an application of Noether normalization, we want to justify the fact men- 
tioned already before that a finitely generated field extension is not necessarily 
a ring extension of finite type. To do this we need an auxiliary result. 
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Lemma 7. Let A > B be an integral extension of integral domains. If one of 
the rings A and B is a field, then the same is true for the other as well. 


Proof. Let A be a field and b ¥ 0 an element of B. Then 6 satisfies an integral 
equation over A, say 


B+ ab +... +a, = 0, Q,---,@, € A, 


and we may assume a, # 0. Indeed, pass to the field of fractions of B and 
multiply the equation by a suitable power of b~!. Now, if a, 4 0, its inverse a;! 
exists in A and we get 


bot = —an"(b"* 4+ ab”? +... + an_i) € B. 


Thereby we see that B is a field. 
Conversely, if B is a field, consider an element a € A, a 4 0. Then its inverse 
a~' exists in B and satisfies an integral equation over A, say 


a"™+aa"t!+...+a, =0, Q1,-..,4n € A. 


But then 


a = —a, — aga—...—a,a" 1 € A, 


and it follows that A is a field. 


Corollary 8. Let K C L be a field extension satisfying L = K[21,...,2n] for 
some elements £1,...,%, € L, i.e., assume that K C L, as a ring extension, is 
of finite type. Then the extension K C L is finite. 


Proof. Due to Theorem 6 on Noether normalization, there are elements y1,..., Yr 
in L such that the ring extension K[y1,...,y,] © L is finite and the elements 
Yi,--+,Yr are algebraically independent over K. Since L is a field, the same is 
true for K[y,...,y-] by Lemma 7. However, a polynomial ring over a field K 
in r variables cannot be a field if r > 0. Therefore, we have necessarily r = 0, 
and the extension kK < L is finite. 


A situation as considered in Corollary 8 can easily be set up by looking at 
polynomial rings modulo maximal ideals. 


Corollary 9. Let K[X,...,Xn] be the polynomial ring in n variables over a 
field K and let mc K[X,...,Xn] be a maximal ideal. Then the canonical map 
K —> K[X,...,X,]/m = L is finite, so that L/K is a finite field extension. 


Proof. Since m is a maximal ideal in K[X,..., Xn], we see that L is a field. 
Furthermore, if x; € L is the residue class of the variable X; for each i, we 
get L = K[a,...,%,]. Therefore, L/K is of finite type, and thus a finite field 
extension by Corollary 8. 
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Now consider the field of rational functions A(X) in one variable X over a 
field K. Then the field extension K(X)/K is finitely generated, namely by the 
variable X. However, as a ring extension it cannot be of finite type following 
Corollary 8, since the degree [K'(X) : K’] is infinite. Thereby we see that as 
mentioned before, the properties “finitely generated” and “of finite type” are 
not equivalent in the setting of field extensions. 


Exercises 


1. Let A C B be an integral ring extension. Discuss the problem of whether one 
can define for an element b € B its minimal polynomial over A. As an example, 
consider the extension A = {)\c,X' € K[X];c = 0} C K[X] = B, where 
K(X] is the polynomial ring in one variable over a field K. 


2. For aring homomorphism A —> B, let A denote the set of all elements in B that 
are integral over A. Show that A is a subring of B and that A —> B restricts to 
an integral homomorphism A —> A. The ring A is called the integral closure of 
Ain B. 


3. Let A be a unique factorization domain. Show that A is integrally closed in 
its field of fractions, i.e., that the integral closure of A in @(A) in the sense of 
Exercise 2 coincides with A. 


4. Let y: A @ A’ be an integral ring extension. Show for every maximal ideal 
m’ c A’ that the ideal y~!(m’) C A is maximal as well, and conversely for 
every maximal ideal m C A, that there exists a maximal ideal m’ C A’ satisfying 
yg t(m’) = m. Hint: For a maximal ideal m C A, consider the multiplicative 
system S = A—m, as well as the associated rings of fractions S~'A and S~!.A’ 
introduced in Section 2.7. In addition, one may use the fact that every nonzero 
ring admits a maximal ideal; cf. 3.4/6. 


3.4 Algebraic Closure 


The objective of the present section is to construct for a given field K a so- 
called algebraic closure, i.e., a minimal algebraic extension field K such that 
every nonconstant polynomial in KX] admits at least one zero in K. We start 
with Kronecker’s construction that was already mentioned before. It allows for 
a single nonconstant polynomial f € AK [LX] to set up a finite extension field 
L/K such that f acquires a zero in L. 


Proposition 1. Let K be a field and f € K[X] a polynomial of degree > 1. 
Then there exists a finite algebraic field extension K C L such that f admits a 
zero in L. If f is irreducible, we can set L := K[X]/(f). 


Proof. We may assume that f is irreducible; otherwise, decompose f into its 
prime factors and replace it by one of these. Then (f) is a maximal ideal in 


3.4 Algebraic Closure 101 


K[X] by 2.4/6, and it follows that L := KLX]/(f) is a field. Now consider the 
composition 


KO K[X] 4 K[X]/(f) =L, 


where 7 is the canonical epimorphism. Since the homomorphism kK —> L is 
injective as a homomorphism between fields, we can view L as a field extending 
kK. Then, writing x := 1(X) and f = )7y_) GX", we get 


f(a) = Sa = oe (X)i=n (~ ax") =7(f) =0, 
i=0 


i=0 


which shows that x is a zero of f. In particular, x is algebraic over K and 
satisfies L = K(x). Assuming f to be monic, it is the minimal polynomial of 
x over K, and we see from 3.2/6 that L/K is a finite field extension of degree 
n= deg f. 


Using Kronecker’s construction, we say that L is obtained from K via ad- 
junction of a zero x of f. In more precise terms, if f is irreducible, the zero x is 
defined as a residue class of the variable X and is forced to become a zero of f 
by passing from A LX] to its residue class ring L = K{X]/(f), which is a field. 
Then, over L, the linear factor X — x splits off from f, say f = (X —x)-g, and 
Kronecker’s construction can again be applied to g, unless the latter is already 
constant. Thus, after finitely many steps of this type, we arrive at an extension 
field K’ of K over which f admits a factorization into linear factors. In principle, 
such a process has to be applied simultaneously to all nonconstant polynomials 
in K LX] in order to construct an algebraic closure of K. 


Definition 2. A field K is called algebraically closed if every nonconstant 
polynomial f of K[X] admits a zero in K, or in other words, if f decomposes 
in K[X] into a product of linear factors. This means that f admits a product 
decomposition f = c[],(X—a;) with a constant c € K* as well as zeros a; € K. 


Remark 3. A field K is algebraically closed if and only if every algebraic field 
extension L/K is trivial. 


Proof. Assume that K is algebraically closed and that kK C L is an algebraic 
field extension. Furthermore, consider an element a € L together with its mini- 
mal polynomial f € A[X]. Then f decomposes over K into a product of linear 
factors and hence is linear, since it is irreducible. In particular, this shows that 
a € K and therefore L = Kk. Conversely, assume that K does not admit any 
nontrivial algebraic field extension and consider a polynomial f € K[X] of 
degree > 1. Using Kronecker’s construction, there exists an algebraic field ex- 
tension L/K such that f admits a zero in L. However, by our assumption we 
must have L = K, so that f has a zero in kK. Consequently, K is algebraically 
closed. 
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Theorem 4. Every field K admits an extension field L that is algebraically 
closed. 


For the proof of the theorem we need to know that every ring R 4 0 
contains a maximal ideal. The latter result is a consequence of Zorn’s lemma, 
whose assertion we will explain next. 

Let M be a set. A (partial) order on M is a relation! < such that the 
following conditions are satisfied: 


«<a forallae M (reflexivity) 
CY, YRS SS Bz (transitivity) 
osyysu = v=y (antisymmetry) 


The order is called total if every two elements x,y € M are comparable, i.e., if 
we have x <yory<2. 

For example, the standard less-than-or-equal relation < between real num- 
bers constitutes a total order on R. But we can also look at a set X and define 
M as its power set consisting of all subsets of X. Then the inclusion of subsets 
in X gives rise to a partial order on M. In general, it is not a total order, since 
for U,U' C X we do not necessarily have U Cc U' or U’ C U. In a similar way, 
we can consider for a ring R the set M of its proper ideals a € R, together with 
the inclusion as partial order. Then a is a maximal ideal in R if and only if ais a 
maximal element of 7. To be more specific on maximality, let us introduce for 
a set M with partial order < and an element a € M the following terminology: 

a is called the greatest element of M if « < a holds for all x € M; such an 
element a is unique if it exists. 

ais called a maximal element of M if a < x for some x € M always implies 
a=. 

a is called an upper bound for a subset N C M if a <a for alla EN. 

If there exists a greatest element in M, it is the unique maximal element in 
M. However, note that a partially ordered set M can contain several different 
maximal elements. If that is the case, there cannot exist a greatest element 
in M. 


Lemma 5 (Zorn). Let M be a partially ordered set such that every subset of 
M that is totally ordered with respect to the order induced from M admits an 
upper bound in M. Then there exists a maximal element in M.? 


For an elementary justification of the above result we refer to [12], Ap- 
pendix 2, §2. However, it should be pointed out that Zorn’s lemma is of ax- 
iomatic character. It is equivalent to the so-called axiom of choice, asserting 


T A relation on M is a subset RC M x M, where in the present case, we will write x < y 
if (x,y) ER. 

? Note that: the empty subset in M is totally ordered and therefore admits an upper bound 
in M if the assumptions of the lemma are met. In particular, we must have M 4 0) then. 
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that the Cartesian product of a nonempty family of nonempty sets is nonempty. 
As an application of Zorn’s lemma we prove the following result: 


Proposition 6. Let R be a ring and a ¢ R a proper ideal. Then R admits a 
maximal ideal m containing a. In particular, every ring R 4 0 admits a maximal 
ideal. 


Proof. Let M be the set of all proper ideals b € R containing a. Then / is 
partially ordered under the inclusion of ideals. Furthermore, since a € M, we 
see that M # §. We claim that every totally ordered subset N C M admits 
an upper bound in M. Indeed, let N be such a subset, where we may assume 
N #0. Then ¢ = U,en 6 is a proper ideal in R containing a, as is easily checked 
using the total order on N, and it follows that ¢ € M is an upper bound of N. 
Therefore, the assumptions of Zorn’s lemma are met, and M admits a maximal 
element, namely a maximal ideal m C R such that a C m. 


Proof of Theorem 4. We are now able to construct for a given field K an exten- 
sion field L that is algebraically closed. The construction process we will use is 
based on polynomial rings in infinitely many variables over K and goes back 
to E. Artin. In a first step we set up a field LZ, extending AK such that every 
polynomial f € K[X] of degree deg f > 1 admits a zero in L;. To do this we 
consider the system of variables X = (Xf) re, that is indexed by the set 


T={feK[X]; degf> 1}, 
and work with the polynomial ring A [*X]. More specifically, we look at the ideal 
a= (f(Xs); fe CK 


that is generated by the family of polynomials f(X,), where the variable X 
of f is replaced by Xy, for each f € J. We claim that a is a proper ideal in 
K[X]. Indeed, suppose that is not the case. Then we have 1 € a, and there is 
an equation 


YD afilXs) =1 


for suitable polynomials fi,..., fn € J and gi,..., gn € K[X]. Applying Kro- 
necker’s construction to the finitely many polynomials f;, there exists a field 
kK’ extending K such that each f; admits a zero a; in K’. Now, going back to 
the above equation, we may substitute Xy, by a; for i = 1,...,n, using the 
substitution homomorphism K[X] —+> K’ that substitutes Xy, by a; and the 
remaining variables by arbitrary values in K’, for example by 0. Hence, the 
left-hand side of the above equation vanishes, contradicting 1 on the right-hand 
side. Therefore a must be a proper ideal in K [¥], as claimed. 

Next, apply Proposition 6 and choose a maximal ideal m Cc K[¥] containing 
a. Then L; = K[X]/m is a field, which we will view as an extension of K via 
the composition of canonical maps 
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K > K[X] 3 K[X]/m= Ly. 


Similarly as for Kronecker’s construction, we conclude for f € J that the residue 
class X + of X» € K[X] in K[X]/m is a zero of f € K[X]. Note that again, 
the zeros of the polynomials f € J come into existence by passing to residue 
classes modulo a, resp. m. 

To end the proof of Theorem 4 we proceed as follows. Iterating the just 
described construction, we arrive at a chain of fields 


Reig Clyt ig... 
such that each nonconstant polynomial f € L,[X], n € N, admits a zero in 


Inii. The union 
b= |) oy 
n=0 


of this ascending chain of fields is itself a field, and we claim that L is alge- 
braically closed. Indeed, consider a nonconstant polynomial f € L[X]. There 
exists an index n € N satisfying f € L,[X], since f has only finitely many 
nonzero coefficients. Then f admits a zero in L,4, by our construction and 
thereby in L. This shows that L is algebraically closed, which completes our 
proof of Theorem 4. 


Let us point out that we actually have L = Ly in the setting of the above 
proof; cf. Exercise 10 in Section 3.7. However, to justify this fact we need some 
additional information that is not yet available at the present stage. 


Corollary 7. Let K be a field. Then there exists an algebraically closed field K 
extending K, where K is algebraic over K; such a field K is called an algebraic 
closure of K. 


Proof. If we look a bit closer at the construction of an algebraically closed field 
L extending a field K, as exercised in the proof of Theorem 4 above, we can 
easily realize that DL is algebraic over K and therefore admits the properties of 
an algebraic closure of A’. Indeed, by its construction, the extension Ly /Ly_, is 
generated by a family of algebraic elements so that L,/Ln_, is algebraic due to 
3.2/11. Then, using induction, it follows from 3.2/12 that all L,, are algebraic 
over K. Since L is the union of the L,, we see that L is algebraic over K. 

Alternatively, if Z is an arbitrary algebraically closed field extending A’, we 
can set 

K= fa € L; a is algebraic over Kh. 


Then K is a field and hence an algebraic extension field of K, since a, 3 € K 
implies K(a, 8) C K. Furthermore, K is algebraically closed. Indeed, consider 
a nonconstant polynomial f € K[X]. Since L is algebraically closed, f admits 
a zero y in L. This zero is algebraic over K, and by 3.2/12, algebraic over K, 
so that we get y € K. 
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As an example, we refer to the (not yet established) fact that C is an 
algebraic closure of R. Moreover, we can define an algebraic closure Q of Q by 
setting 

Q= {a € C; a is algebraic over Q}. 


Note that Q is different from C, since C contains elements such as e and 7 
that are transcendental and therefore not algebraic over Q. Alternatively, the 
inequality Q 4 C can be justified by means of a cardinality argument if we 
use the fact that the algebraic closure of a field is unique up to (noncanonical) 
isomorphism (cf. Corollary 10 below). Indeed, C is of uncountable cardinality, 
while the explicit construction of an algebraic closure of Q via the proof of 
Theorem 4 implies that Q is countable. 

Finally, we want to show that every two algebraic closures of a given field 
are isomorphic over kK, although in general, there will exist several isomorphisms 
of this type, none of them canonical. To settle this question we must study 
the problem of extending field homomorphisms kK —> L to algebraic field 
extensions K’/K. Let us add that the corresponding results of Lemma 8 and 
Proposition 9 below not only are of interest for the question on the uniqueness 
of algebraic closures, but also play an important role for the characterization of 
separable field extensions in 3.6, as well as for setting up Galois theory in 4.1. 

We still need a convenient notation for the transport of polynomials with 
respect to homomorphisms. If 0: kK —> L is a field homomorphism and 
K[X] —> L[X] the induced homomorphism on polynomial rings, we denote 
by f? € LLX] the image of a polynomial f € KX]. For every zero a € K of 
f, it is immediately clear that its image o(a) is a zero of f7. 


Lemma 8. Let K be a field and K' = K(a) a simple algebraic field extension 
of K with attached minimal polynomial f € K(X] of a. Furthermore, let 
a: K —> L be a field homomorphism. 

(i) If o': K' —+ L is a field homomorphism extending o, then o'(a) is a 
zero of f°’. 

(ii) Conversely, for every zero 8 € L of f? € LLX], there is precisely one 
extension o': K' —+ L of o such that o/(a) = 8. 

In particular, the different extensions o' of o are in one-to-one correspon- 
dence with the distinct zeros of f° in L, and the number of these is < deg f. 


Proof. For every extension o’: kK’ —+ L of o we get from f(a) = 0 necessarily 
f°(o'(a)) = o'(f(a)) = 0. Moreover, since K’ = K [a] by 3.2/9, every extension 
o’: K' —+ L of a is uniquely determined by the image o’(q) of a. 

It remains to show for each zero 6 € L of f? that there is an extension 
a’: K' —+ L of o satisfying o’(a) = 8. To do this, consider the substitution 
homomorphisms 


yg: K(X] > Kla],  g+> gla), 
w: K[X] —> tb, gt g9°(8). 
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We have (f) = kery by 3.2/5, as well as (f) C kerw, since f7(8) = 0. If 
a: K[X] —> K[X]/(f) denotes the canonical projection, we obtain via the 
fundamental theorem on homomorphisms 2.3/4 a commutative diagram 


K(X] 


K[a]-* K[X]/(f) = 1 


with homomorphisms G and 7 that are unique. Since G is an isomorphism, we 
recognize o’ := yo G+ as an extension of o satisfying o/(a) = B. 


Proposition 9. Let kK C K’ be an algebraic field extension and a: K —> L 
a field homomorphism with image in an algebraically closed field L. Then o 
admits an extension o': K' —+ L. In addition, if K' is algebraically closed and 
L algebraic over o(K), then every extension o' of o is an isomorphism. 


Proof. The main work was already done in Lemma 8, and it remains only to 
apply Zorn’s lemma. Let M be the set of all pairs (F,7) consisting of an in- 
termediate field F, kK C F C K’', as well as an extension 7: F’ —> L of oa. 
Then MM is partially ordered under the relation < if we write (F,r) < (£",7') 
when FC F” and r'|r =7 hold. Since (K,o) belongs to M, we see that M is 
not empty. Furthermore, using the standard union argument, we see that every 
totally ordered subset of MM admits an upper bound. Thus, the assumptions of 
Zorn’s lemma are met, and it follows that M contains a maximal element (FT). 
But then F = Kk’, since otherwise, we could fix an element a € K’ — F and 
extend 7 with the help of Lemma 8 to a homomorphism 7’: F'(a) —> L, con- 
tradicting the maximality of (F',7). In particular, the existence of the desired 
extension o’: kK’ —+ L of a is clear. 

If, in addition, K’ is algebraically closed, then the same is true for o’(K’). 
Furthermore, if Z is algebraic over o(/‘), it is algebraic over o’(KX’) as well, and 
we conclude that o’(A’) = L using Remark 3. Since field homomorphisms are 
always injective, a’ is an isomorphism. 


Corollary 10. Let K, and Ky be two algebraic closures of a field K. Then 
there exists an isomorphism K, + Ky, noncanonical in general, that extends 
the identity map on K. 


Exercises 


1. Let f € QLX] be a polynomial of degree > 1. Why is it less complicated to 
construct a zero of f within the “setting of algebra” than within the “setting of 
analysis” ? 

2. Why is it not possible to prove the existence of an algebraic closure K of a field K 
along the following lines: Consider all algebraic extensions of K and observe for 
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a family (K;)ier of such fields that is totally ordered with respect to the inclusion 
relation that also the union Ue, Ki is an algebraic extension of K. Therefore, 
Zorn’s lemma yields the existence of a maximal algebraic extension and thereby 
of an algebraic closure of K. 


3. Why should one make a difference between particular algebraic closures of a field 
K and avoid talking about “the” algebraic closure of K? 


4. Let K bea field and f € K[X] a polynomial of degree > 1. Show for a maximal 
ideal m C K[X] containing f that L = K[X]/m can be viewed as an algebraic 
extension field of K in which f admits a zero. Furthermore, show that D coincides 
with the extension field one obtains by applying Kronecker’s construction to a 
suitable irreducible factor of f. 


5. Let K be an algebraic closure of a field K. Show that K is countable if the same 
is true for K. 


6. Show that every algebraically closed field consists of infinitely many elements. 


7. Let L/K be a finite field extension of degree [L : K] = n. Assume there is an 
element a € L together with isomorphisms o;: L —> L, i =1,...,n, satisfying 
oilk = idx and o;(a) 4 o;(a) for i 4 j. Show that L = K(a). 


8. Let Q be an algebraic closure of Q. Determine all homomorphisms Q(v2 i) 3 Q 
as well as their images. 


3.5 Splitting Fields 


In the present section we start with some preparations that will be of particular 
interest later when we study Galois theory. As prerequisites we use the existence 
of algebraically closed fields, as established in Section 3.4, as well as the results 
3.4/8 and 3.4/9 on the extension of field homomorphisms. If L/AK and L'/K 
are two field extensions and 0: L —>+ L’ is a field homomorphism, we call o a 
K-homomorphism if it restricts to the identity map on K. 


Definition 1. Let ¥ = (fijier, fi € KX], be a family of nonconstant polyno- 
mials with coefficients in a field K. A splitting field (over K) of the family ¥ 
is a field L extending K such that: 
(i) Every polynomial f; decomposes into a product of linear factors over L. 
(ii) The extension L/K is generated by the zeros of the polynomials f;. 


In its simplest case, the family § consists of a single polynomial f € K[X]. 
Then, if K is an algebraic closure of K and a,,...,d@, are the zeros of f in K, 
it follows that L = K(a,...,a@,) is a splitting field of f over K. In a similar 
way, one shows that splitting fields exist for arbitrary families § of nonconstant 
polynomials f; € K[X]. Just choose an algebraic closure K of K and define L 
as the subfield of K that is generated over K by all zeros of the polynomials f;. 
If the family ¥ consists of only finitely many polynomials fi,..., f,, then every 
splitting field of the product fi-...- f, is a splitting field of F and vice versa. 
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Proposition 2. Let Ly, Ly be two splitting fields of a family § of noncon- 
stant polynomials in K[X], for a field K, and let Lz be an algebraic closure of 
Ly. Then every K-homomorphism o: L, — TL» restricts to a K-isomorphism 
o: Ly > Lp. 


In particular, since the inclusion K <> Ly extends to a K-homomorphism 
a: L, —> Lz by 3.4/9, it follows that LZ, is K-isomorphic to Ly. Therefore, we 
can state the following corollary: 


Corollary 3. Let L,, Ly be two splitting fields of a family of nonconstant 
polynomials in K|X]. Then there exists a K-isomorphism Ly > Ly. 


Proof of Proposition 2. First we consider the case in which § consists of a single 
polynomial f, which we may assume to be monic. Let a),...,a@, be the zeros of 
f in L, and bj,...,b, the zeros of f in Ly C Ly. Then we get 


f? =|] (X-a(a)) = [[(x - 4), 


and @ maps the set of the a; bijectively onto the set of the b;. Hence, we see 
that 


Ly = K(hy,...,bn) = K (a(a), or ,F(an)) = G(I4), 


i.e., © restricts to a K-isomorphism 0: L, —+ Ly, as claimed. 

The special case just dealt with settles the assertion of the proposition for 
finite families ¥ as well; just view L; and Lz as splitting fields of the product 
of all polynomials in ¥. Finally, the general case is derived by viewing L, and 
Ly as unions of splitting fields corresponding to finite subfamilies of ¥. 


We want to characterize the property of a field L being a splitting field of 
a family of polynomials in KX] by equivalent conditions and then introduce 
normal field extensions. 


Theorem 4. The following conditions are equivalent for a field K and an 
algebraic extension K C L: 
(i) Every K-homomorphism L —+ L into an algebraic closure L of L 
restricts to an automorphism of L. 
(ii) L is a splitting field of a family of polynomials in K(X]. 
(iii) Every irreducible polynomial in K[X] that admits a zero in L decom- 
poses over L completely into linear factors. 


Definition 5. An algebraic field extension K C L is called normal if it satisfies 
the equivalent conditions of Theorem 4. 


Proof of Theorem 4. We start with the implication from (i) to (iii) and consider 
an irreducible polynomial f € K[X] admitting a zero a € L. If b € L is 
another zero of f, we can conclude from 3.4/8 that there is a K-homomorphism 
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o: K(a) —> L satisfying o(a) = b. Furthermore, using 3.4/9, we can extend o 
to a K-homomorphism o’: L —+ L. Now, if condition (i) is given, we obtain 
o'(L) = Land thereby see that b = o’(a) € L. Hence, all zeros of f are contained 
in L, and f decomposes over L into a product of linear factors. 

Next we show that (iii) implies condition (ii). Let (a;)ier be a family of 
elements in L such that the field extension L/K is generated by the elements 
a;. Furthermore, let f; be the minimal polynomial of a; over K. Then, according 
to (iii), all f; decompose over L into a product of linear factors, and we see that 
L is a splitting field of the family ¥ = (fi)ier. 

Finally, assume condition (ii), i.e., that L is a splitting field of a family ¥ 
of polynomials in K[X]. If ¢: L —> L is a K-homomorphism, it follows that 
o(L), just like L, is a splitting field of ¥. However, then we must have o(L) = L, 
since both fields are subfields of L; see also Proposition 2. 


Every polynomial of degree 2 decomposes over a given field L into a product 
of linear factors, provided it admits a zero in L. Thus, we see from condition (ii) 
(or even (iii)) of Theorem 4 that field extensions of degree 2 are always normal. 
Furthermore, we can draw the following conclusion: 


Remark 6. If K C L C M is a chain of algebraic field extensions and if M/K 
is normal, then the same is true for M/L. 


Let us add that the property of a field extension being normal is not transi- 
tive, ie., for a chain of fields K C LC M such that L/K and M/E are normal, 
the extension M/K does not need to be normal. For example, Q(V2)/Q and 
Q(V2)/Q(V2) are normal extensions, since they are of degree 2, while the ex- 
tension Q(W2)/Q is not normal. Indeed, the polynomial X* — 2 is irreducible 
over Q and admits in Q(W2) the zero W2. On the other hand, X4 — 2 is not 
a product of linear factors over Q( V2), since the complex zero i v2 does not 
belong to Q(v/2) C R. 

It is often useful to know that every algebraic field extension L/K admits a 
normal closure, by which we mean an extension field L’ of L such that L’/L is 
algebraic and L’/K is normal with the property that there is no proper subfield 
of L’ satisfying these conditions. Thus, L’ is, so to speak, a minimal extension 
of EL such that L’/K is normal. 


Proposition 7. Let L/K be an algebraic field extension. 
(i) L/K admits a normal closure L'/K, where L' is unique up to (non- 
canonical) isomorphism over L. 

(ii) L//K is finite if L/K is finite. 

(iii) If M/L is an algebraic field extension such that M/K is normal, we 
can choose L' as an intermediate field of M/L. In this case, L' is unique. More 
precisely, if (o;)ier is the family of all K-homomorphisms from L to M, then 
L' = K(o;(L);i € I). We call L’ the normal closure of L in M. 
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Proof. Assume L = K (2), where 2 = (a,;)jey is a family of elements in L. Let 
f; be the minimal polynomial of a; over K. If M is an algebraic extension field 
of LZ such that M/K is normal (for example, choose for M an algebraic closure 
of L), then it follows from Theorem 4 (iii) that the polynomials f; decompose 
in M[X] into a product of linear factors. Now let L’ be the subfield of M that 
is generated over KX by the zeros of the f;. Then L’ is defined as a splitting 
field of the f;. Furthermore, we have L C L’ C M, and it is clear that L’/K is 
a normal closure of L/. Also we see that L’/K is finite if L/K is finite. On 
the other hand, if L’/K is a normal closure of L/K, then the field L’ contains 
necessarily a splitting field of the f; and thus, due to the minimality condition, 
is a splitting field of the f; over K. 

To establish the uniqueness assertion, consider two normal closures Li /K 
and L4/K of L/K. As we have just seen, Li and L‘ are splitting fields of the 
polynomials f; over Kk and hence also splitting fields of the f; over L. Then 
Corollary 3 yields the existence of an L-isomorphism L', —> L4, which implies 
the uniqueness assertion of (i) and, in conjunction with Theorem 4 (i), also the 
uniqueness assertion of (iii). 

Finally, to derive the explicit characterization of L’ as given in (iii), consider 
a K-homomorphism 0: L —+ M. By 3.4/8 it maps the zeros of the f; to zeros 
of the same type. Since L’ is generated over K by these zeros, we see that 
K(o;(£);7 € I) C L’. Conversely, for every zero a € L’ of one of the polynomials 
f; we can define, due to 3.4/8 again, a K-homomorphism K(a;) —> L’ such 
that a; +» a. This can be extended via 3.4/9 to a K-automorphism of an 
algebraic closure of L’ and subsequently be restricted to a K-homomorphism 
a: L —+ L’, using the normality of L’/K. Thus, a € K(o;(L);i € I), and the 
equality L’ = K(a;(L);i € I) is clear. 


Exercises 
1. Give a detailed justification for the fact that field extensions of degree 2 are always 
normal. 


2. Let L/K be a field extension, where L is a splitting field of a nonconstant polyno- 
mial f € K[X]. Explain once again why every irreducible polynomial of K[X] 
admitting a zero in L will decompose over L into a product of linear factors. 


3. Let K be a field and L a splitting field of the family of all nonconstant polynomials 
in K[X]. Show that L is an algebraic closure of K. 


4. Show for a finite field extension L/K that condition (i) in Theorem 4 is equivalent 
to the following one: 


(i!) Every K-homomorphism L —> L’ into a finite field extension L’ of L re- 
stricts to an automorphism of L. 


Replace condition (i) of Theorem 4 by (i’) and sketch a proof of this theorem for 
finite extensions L/J that avoids the existence of an algebraic closure of K. 


5. Consider L = Q(¥2,i) as a field extending Q. 
(i) Show that L is a splitting field of the polynomial X* — 2 € Q[X]. 
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(ii) Determine the degree of L over Q, as well as all Q-automorphisms of L. 
(iii) Show that L = Q(V2 +2) using Exercise 7 of Section 3.4. 


6. Determine a splitting field L of the polynomial X4 + 2X? — 2 over Q, as well as 
the degree [L : Q]. 


7. Check whether the field extension Q (v 2+ v2) /Q is normal. 


8. Let K bea field and f € K[X] a polynomial of degree n > 0. Let L be a splitting 
field of f over K. Show: 
(i) [L: K] divides nl. 
(ii) If [L: K] =n!, then f is irreducible. 
9. Determine a splitting field L over Q of the family {X 441, X54 2} and compute 
the degree [L : Q]. 
10. Consider f = X®°—7X*4+3X?+3 as a polynomial in Q(X], as well as in F)3[X]. 


In either case, decompose f into its irreducible factors and determine a splitting 
field of f over Q, resp. Fis. 


11. Let K(a)/K and K(8)/K be simple algebraic field extensions with minimal 
polynomials f of a, resp. g of 8, over K. Show that f is irreducible over K(() 
if and only if g is irreducible over K(a). Furthermore, show that f and g are 
irreducible if deg f and deg g are prime to each other. 


12. Let L/K be a normal algebraic field extension and f € K[X] a monic irreducible 
polynomial. Let f = f1...f- be a prime factorization of f in L[X] with monic 
factors f;. Show for every two factors fj, f;,7 4 j, that there is a K-automorphism 
a: L —>+ [ satisfying f; = f7. 


a 


13. Let L/K and L’/K be normal algebraic field extensions and let L” be a field 
containing L and L’ as subfields. 


(i) Show that (ZN L’)/K is a normal algebraic field extension. 


(ii) Use (i) to give an alternative proof of Proposition 7. 
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When looking at polynomials f € K[X] over a field K, it is convenient to 
consider their zeros in an algebraic closure K of K. Assertions on the zeros of 
polynomials f are quite often independent of the choice of K, since such an 
algebraic closure is unique up to K-isomorphism. For example, it is meaningful 
to say that f has only simple zeros, or that f has multiple zeros. Nonconstant 
polynomials with only simple zeros, i.e., zeros of multiplicity 1, are called sep- 
arable. 


Lemma 1. Let K be a field and f € K[X] a nonconstant polynomial. 
(i) The multiple zeros of f (in an algebraic closure K of K) coincide with 
the common zeros of f and its derivative f', or equivalently, with the zeros of 


gcd(f, f’). 
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(ii) If f ts irreducible, it has multiple zeros if and only if its derivative f' is 
identically zero. 


Proof. Assertion (i) is a consequence of 2.6/3, at least when KC is algebraically 
closed. (In Section 2.6, zeros of polynomials f € K [|X] were always consid- 
ered in K, not yet in an algebraic closure of AK.) To settle the general case 
in which kK is not necessarily algebraically closed, it is enough to point out 
that a greatest common divisor d = gcd(f, f’) in K[X] is at the same time 
also a greatest common divisor of f and f’ in K[X]. To justify this, use the 
ideal-theoretic characterization of the greatest common divisor in principal ideal 
domains 2.4/13. Indeed, from the equation d- K[X] = f: K[X] + f’: KLX] 
we conclude that d- K[X] = f-K[X]+ f’- K[X], ie., we have d= gcd(f, f’) 
in K[X] as well as in K[X]. 

To verify (ii), assume that f is irreducible and, in addition, monic. Then, 
if a € K is a zero of f, we recognize f as the minimal polynomial of a over 
K. Furthermore, we know from (i) that a is a multiple zero of f if and only if 
a is a zero of f’ as well. However, since deg f’ < deg f and f is the minimal 
polynomial of a over K, it is clear that jf’ can vanish at a only if it equals the 
zero polynomial. 


A nonconstant polynomial f € AK [LX] has always a nontrivial derivative 
f’ £ 0 if char K = 0. Therefore, assertion (ii) of the lemma implies that irre- 
ducible polynomials are separable in characteristic 0. On the other hand, there 
are irreducible polynomials in characteristic > 0 that are not separable. For 
example, let p be a prime number, ¢ a variable, and consider the function field 
K =F,(t) = Q(F,[t]). Then X? — t, as a polynomial in K[X], is irreducible 
by Eisenstein’s criterion 2.8/1, but not separable, since f’ = pX?~! = 0. In the 
following, let us look more closely at the case of positive characteristic. 


Proposition 2. Let K be a field and f € K[X] an irreducible polynomial. 

(i) If char K = 0, then f is separable. 

(ii) If char K = p> 0, choose r € N maximal such that f is a polynomial in 
X?", i.e., such that there is a polynomial g € K[X] satisfying f(X) = g(X”’). 
Then every zero of f is of multiplicity p" and g is an irreducible polynomial 
that is separable. The zeros of f equal the p"th roots of the zeros of g. 


Proof. The case char kK = 0 was discussed before, so assume char K = p > 0. 
Furthermore, write 


ye f= igx™. 
i=0 i=1 


Then f’ = 0 is equivalent to ic; = 0 for i = 1,...,n. Since ic; vanishes precisely 
when we have p|i or c¢; = 0, we conclude that f’ is the zero polynomial precisely 
when there exists some h € K[X] such that f(X) = h(X°*). 
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Now assume f(X) = g(X°”’), as specified in assertion (ii). If we apply the 
above reasoning to g instead of f, we get g’ # 0 due to the maximality of 
r. Furthermore, g is irreducible, since the same is true for f. Therefore, g is 
separable by Lemma 1 (ii). Now choose an algebraic closure K of K and consider 
a factorization 

9 =|] (X - 4), a; € K, 
where we have assumed f and therefore also g to be monic. Introducing p'th 
roots c; € K such that & = a;, we obtain 


f= Tx" - 2) =T]x-a" 


a 


using 3.1/3, and thereby see that all zeros of f are of multiplicity p’. 


Next we want to define the notion of separability for algebraic field exten- 
sions. 


Definition 3. Let K C L be an algebraic field extension. An element a € L is 
called separable over K if a is a zero of a separable polynomial of KX], or 
equivalently, if the minimal polynomial of a over K is separable. The field L is 
called separable over K if every element a € L is separable over K. 


A field K is called perfect if every algebraic extension of K is separable. For 
example, we can deduce from Proposition 2 (i) the following: 


Remark 4. Every algebraic field extension in characteristic 0 is separable. In 
particular, all fields of characteristic 0 are perfect. 


For a prime number p and a variable t we have already seen that the poly- 
nomial X? —t € F,(t)[X] is irreducible, but not separable. Therefore, the field 
'(t) [|X ]/(X? — t) is not separable over F,(¢). Equivalently, we can state that 
the algebraic field extension F,,(t)/F,(#?) fails to be separable, since X? — ??, 
as an irreducible polynomial in F,(¢?)_X], is the minimal polynomial of t over 
“(t?. 

In the following we want to characterize separable algebraic field extensions 
more thoroughly. In particular, we want to show that an algebraic field extension 
is separable as soon as it is generated by separable elements. To achieve this, 
we need the notion of the separable degree as a replacement of the usual degree, 
which was used in studying general algebraic field extensions. 


Definition 5. For an algebraic field extension K Cc L, let us denote by 
Homg(L, K) the set of all K-homomorphisms from L into an algebraic clo- 
sure K of K. Then the separable degree of L over K, denoted by [L: K],, is 
defined as the number of elements in Home (L, K), i.e., 


[L: K],:= #Homx(L, K). 
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It follows from 3.4/10 that the separable degree of an algebraic extension 
L/K is independent of the choice of an algebraic closure K of K. Let us compute 
the separable degree for simple algebraic field extensions. 


Lemma 6. Let K C L = K(a) be a simple algebraic field extension with 
minimal polynomial f € K[X] of a over K. 
(i) The separable degree [L: K], equals the number of different zeros of f 
in an algebraic closure of K. 
(ii) The element a is separable over K if and only if [L: K] =[L: K],. 
(iii) Assume char K = p> 0 and let p" be the multiplicity of the zero a of f 
(cf. Proposition 2 (ii)). Then [L: K] = p"[L: K]s. 


Proof. Assertion (i) is a reformulation of 3.4/8. To justify (ii), let n = deg f. 
Then a is separable if and only if f does not admit multiple zeros and hence has 
n distinct zeros, or according to (i), if and only ifn = [L : K],. However, due to 
3.2/6 we have [L : K] = deg f = n, and it follows that a is separable precisely 
when [L : K] = [L: K],. Finally, assertion (iii) is a direct consequence of 


Proposition 2 (ii). 


To handle the separable degree of more general algebraic field extensions 
we need an analogue of the multiplicativity formula 3.2/2. 


Proposition 7 (Multiplicativity formula). Let K C LC M be algebraic field 
extensions. Then 


[M:K],=[M:L],-[L: K]s. 


Proof. Fix an algebraic closure K of M. Then K C LC M C K, and we may 
view K also as an algebraic closure of K and of L. Furthermore, let 


Homx(L, K) = {o;; i € Th, Hom;(M, K) = {7;; 7 € J}, 


where in each case, the o; as well as the 7; are distinct. Now extend the 
K-homomorphisms o;: L —> K via 3.4/9 to K-automorphisms o;: K —> K. 
The desired multiplicativity formula will then be a consequence of the following 
two assertions: 


(1) The maps G;07;: M —> K,i€ I, j € J, are distinct. 
(2) Homg(M, K) = {G,07;; i € 1,7 € Jt. 


To verify assertion (1), consider an equation of type 7; 07; = Tj 0 7;. Since 
7, and 7; restrict to the identity on L, we can conclude that o; = oj and hence 
i = 7. The latter implies 7; = 7,, and thus 7 = j’. It follows that the maps 
specified in (1) are distinct. Since they are K-homomorphisms, it remains to 
show for (2) that every K-homomorphism +: M —> K is as specified in (1). 
For tr € Homx(M, K) we have r|, € Homg(L, K). Hence, there exists an index 
i € I such that t|, = o;. Then we obtain 7! 07 € Hom,(M, XK), and there 
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exists an index j € J such that o;'! 07 = 7;. Therefore, 7 = 3; 0 7; and (2) is 
clear. 


If we take into account that algebraic field extensions are separable in char- 
acteristic 0 (Remark 4), we can apply the multiplicativity formulas of 3.2/2 
and Proposition 7 inductively and thereby derive from Lemma 6 the following 
result: 


Proposition 8. Let K C L be a finite field extension. 
(i) If char K =0, then [L: K] =[L: K]g. 
(ii) If char K =p > 0, then [L: K] =p"[L: K], for some exponent r € N. 
In particular, 1<[L:K],<[L: K], and [L: Kk], divides [L: K]. 


We are now able to characterize finite separable field extensions in terms of 
the separable degree. 


Theorem 9. For a finite field extension K C L the following conditions are 
equivalent: 
(i) L/K is separable. 
(ii) There exist elements a,,...,@n € L that are separable over K and satisfy 
L= K(a,...,Gn). 
(iii) (LD: K)],=[L: K]. 


Proof. The implication from (i) to (ii) is trivial. If a € L is separable over K, 
then the same is true over every intermediate field of L/K. Therefore, using the 
multiplicativity formulas in 3.2/2 and in Proposition 7, the implication from 
(ii) to (iii) can be reduced to the case of a simple field extension. However, that 
case was already dealt with in Lemma 6 (ii). 

It remains to show that (iii) implies (i). Consider an element a € L with its 
minimal polynomial f € AK[X] over K. To show that a is separable over K, 
which amounts to showing that f admits only simple zeros, we are reduced to 
the case char K = p > 0, due to Remark 4. Then, by Proposition 2 (ii), there is 
an exponent r € N such that every zero of f is of multiplicity p". Hence, we get 

[K(a): K] =p"-|K(a): K], 
from Lemma 6. Using the multiplicativity formulas of 3.2/2 and Proposition 7 
in conjunction with the estimate between the degree and the separable degree 
in Proposition 8, we obtain 


[L: K] =[L: K(a)] - [K(a) : K] 


Now, if [L: K], = [L: K], we must have r = 0. Then all zeros of f are simple 
and a is separable over A’, which shows that (iii) implies condition (i). 
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Corollary 10. Let kK C L be an algebraic field extension and 2A a family of 
elements in L such that L/K is generated by 2. Then the following conditions 
are equivalent: 

(i) L/K is separable. 

(ii) Every a € 2 is separable over K. 

Each of these conditions implies [L: K] =[L: K]s. 


Proof. Every a € L is contained in a subfield of type K(a1,...,@n), where 
@4,.-+,4n € A. In this way, the equivalence between (i) and (ii) is a direct 
consequence of Theorem 9. Furthermore, for L/K finite separable, we conclude 
that [L : K] =[L: K],, again by Theorem 9. Now let L/K be separable while 
[L : K] = oo. Then every intermediate field E of L/K is separable over 
as well, so that we obtain [E: K] = [E: K], for [E : K] < oo and hence 
[L: K], > [E: K] using the multiplicativity formula of Proposition 7. Since 
there exist intermediate fields F of L/K of arbitrarily large degree, we see that 
[L: K],=oo=[L: kK]. 


Corollary 11. Let K C LC M be algebraic field extensions. Then M/K is 
separable if and only if M/L and L/K are separable. 


Proof. We have only to show that the separability of M/Z and L/K implies the 
separability of M/K. Fix an element a € M with minimal polynomial f € L[X] 
over L. Furthermore, let L’ be the intermediate field of L/K that is generated 
over K by the coefficients of f. Since M/L is separable, we conclude that f is 
separable. Therefore, L’(a)/L’ is separable, and the same is true for L’/K, since 
L/K is separable. Moreover, L'(a)/L’ and L’/K are finite, and we see using the 
multiplicativity formulas that 


[L’(a) : K], = [L’(a) : i; . [L’ : K], 
= (Boer) <(e k= (iia). k), 


This shows that L’(a), and in particular a, are separable over K. 


Finally, we want to prove the primitive element theorem, which asserts that 
finite separable field extensions are simple. 


Proposition 12. Let L/K be a finite field extension, say L = K(a,,...,4;), 
and assume that the elements do,...,a, are separable over K. Then the ex- 
tension L/K admits a primitive element, 7.e., an element a € L such that 
L = K(a). In particular, every finite separable field extension admits a primi- 
tive element. 


Proof. Let us first consider the case that K consists of only finitely many ele- 
ments. Then, since [L : kK] < co, also Lis finite. In particular, the multiplicative 
group L* is finite and hence cyclic, as we will show below in Proposition 14. 
Any element a € L generating L* as a cyclic group will also generate the field 
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extension L/IX, and we see that L/K is simple. Note that this argument does 
not use any separability assumption on the extension L/K. However, L/K is 
automatically separable if K is a finite field, as we will see in 3.8/4. 

It remains to look at the case that K consists of infinitely many elements. 
Using an induction argument we may assume r = 2, i.e., that L is generated over 
K by two elements a and 6 with b being separable over K. Write n = [L: K], 
and let o1,...,0n be the distinct elements of Homy(L, K), where as usual, 
is an algebraic closure of K’. Then consider the polynomial 


P=] (ila) - o5(a)) + (o1(6) — 05(0)) -X]. 


tAj 


We claim that P € K[X] is not the zero polynomial. Indeed, for i 4 j we must 
have o;(a) 4 0;(a) or o;(b) € 0;(b), since otherwise o; would coincide with o; 
on L = K[a,b]. Since K contains infinitely many elements, but P can admit 
only finitely many zeros, there must exist an element c € K satisfying P(c) 4 0. 
This implies that the elements 


o;(a) + co;(b) =0,(a+ cb) € K, i=1,...,n, 


are distinct and hence that L’ = K(a+ cb) is a simple algebraic field extension 
of Kk of separable degree 


[Laks = [Le K|s. 


Now L’ C L implies [L’: K], < [L: K], and therefore [L’: K], =[L: K]s. 
Let us show that we have, in fact, L’ = L and hence that L/K is simple. 
By our assumption, b € L is separable over K and hence also over L’, so 
that [L/(b) : L’], = [L’(b) : L’]. Furthermore, the multiplicativity formula of 
Proposition 7 yields 


[L: K],>[V®):K]=(/@:1),[“: K],= (7:1) (Lb: K],. 


This shows that [L’(b) : L'] = 1 and hence L’(b) = L’. Therefore, b belongs to 
L' = K(a+cb), and the same is true for a. In particular, we get L = K(a,b) = Ll’, 
and L is a simple field extension of K. 


It remains to verify that the multiplicative group of a finite field is cyclic. 
To do this we need an auxiliary result from group theory. 


Lemma 13. Let a and b be two elements of finite order in an abelian group G, 
say orda = m and ordb =n. Then there exists an element of order lem(m,n) 
in G. 

More precisely, choose integer decompositions m = mgm', n = non’, where 
lem(m,n) = mono and gcd(mo,no) = 1. Then a™b” is an element of order 
Iem(m,n). In particular, ab is of order mn if m and n are prime to each other. 
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Proof. Let us first assume that m and n are prime to each other, and show 
that ab is of order mn. Clearly, we have (ab) = (a™)"(b")™ = 1. On the 
other hand, the equation (ab)' = 1 implies a” = a™b™ = 1, and we get m|t, 
since gcd(m,n) = 1. Similarly, it follows that n|t and hence mn | t, so that 
ord(ab) = mn. 

In the general case, choose decompositions m = mgm’, n = non’ such that 
Iem(m,n) = mono and ged(mo,no) = 1. To achieve this, consider a prime 
factorization pj’ -...- per of lem(m,n) and define mo as the product of all 
prime powers p;* dividing m, as well as no as the product of all prime powers 
p;' not dividing m. Then we get mo | m as well as no | n, and the resulting 
decompositions m = mgm’, n = non’ clearly satisfy lem(m,n) = mono, as well 
as gcd(mpo, 0) = 1. 

Now, since a’ is of order mo and b” is of order no, the order of a™b”™ 
equals mong by the special case considered before. 


Proposition 14. Let K be a field and H a finite subgroup of the multiplicative 
group K*. Then H is cyclic. 


Proof. Fix an element a € H of maximal order m and let H,, be the subgroup 
of all elements in H whose order divides m. Then all elements of H,, are zeros 
of the polynomial X”™ — 1, so that H,, can contain at most m elements. On 
the other hand, H,, contains the cyclic group (a) generated by a, whose order 
is m. Therefore, we must have H,, = (a), and H,, is cyclic. We claim that in 
fact, H = H,,. Indeed, if there existed an element b € H not belonging to H,, 
and hence whose order n did not divide m, then H would contain an element 
of order lem(m,n) > m, due to Lemma 13. However, this is in contradiction to 
the choice of a. 


Exercises 


1. Recall the proof of the following fact: For an algebraic field extension L/K and 
two elements a,b € L that are separable over K, their sum a-+ b is separable 
over K as well. More thoroughly, show that the set of all elements in L that are 
separable over K yields an intermediate field of L/K. It is called the separable 
closure of K in L. 


2. Let K be a field and f € K[X] a nonconstant polynomial. Why is the assertion 
that f has multiple zeros in an algebraic closure K of K independent of the choice 
of K? 


3. The proof of Proposition 12 yields a practical method for determining primitive 
elements of finite separable field extensions. Give a sketch of this process. 


4. Let K C LC M be algebraic field extensions such that M/K is normal. Show 
that [L: K], = #Homx(L, M). 


5. For a prime number p consider the function field L = F,(X,Y) in two variables 
over F,, as well as the Frobenius homomorphism o: L —> L, a +> a?. Let 
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K = o(L) be the image of o. Determine the degrees [L : K] and [L: K],, and 
show that the field extension L/K is not simple. 


6. Let L/K be a field extension in characteristic p > 0 and consider an element 
a € L that is algebraic over AK. Show that a is separable over K if and only if 
K(a) = K(a?). 

7. An algebraic field extension L/K is simple if and only if it admits only finitely 
many intermediate fields. Prove this assertion using the following steps: 

(i) First discuss the case that K is a finite field, so that after this, K can be 
assumed to be infinite. 

(ii) Assume L = K(a) and let f € KX] be the minimal polynomial of a over 
kx. Show that the set of intermediate fields of L/K can be identified with a 
subset of the set of divisors of f in LLX]. 

(iii) Assume that L/K admits only finitely many intermediate fields. To show 
that L/K is simple, reduce to the case that L is generated over K by two 
elements a, 3. Finally, if L = K(a, 8), consider fields of type K(a +c) for 
constants c € K. 


8. Let K bea finite field. Show that the product of all elements in K* equals —1. As 
an application, deduce for prime numbers p the divisibility relation p|((p—1)!+1). 


3.7 Purely Inseparable Field Extensions 


In the last section we introduced the separable degree [L : K], for algebraic 
field extensions L/K and saw that 1 < [L: K], < [L: K]; cf. 3.6/8. In 
particular, we proved that a finite extension L/K is separable if and only if 
[L : K], = [L: K]. Next we turn to the opposite situation and consider 
algebraic field extensions L/K satisfying [L : K], = 1. Since algebraic field 
extensions are separable in characteristic 0, we assume in the following that 
is a field of characteristic p > 0. 

A polynomial f € K[X] is called purely inseparable if it admits precisely 
one zero a (in an algebraic closure K of K’). Since the attached minimal poly- 
nomial m, € K[X] divides f, we conclude by induction on the degree of f 
that f, assuming it is monic, is a power of m, and thereby a power of an ir- 
reducible purely inseparable monic polynomial. Furthermore, if h € KX] is 
such a monic irreducible polynomial that is purely inseparable, we see using 
3.1/3 and 3.6/2 (ii) that h is of type X?" — c for some n € N and some c€ K. 
Conversely, it is clear that all polynomials of this type are purely inseparable. 
Hence, the monic purely inseparable polynomials in K [|X] consist of precisely 
all powers of polynomials of type X?" — c. 


Definition 1. Let K C L be an algebraic field extension. An element a € L is 
called purely inseparable over K if a is a zero of a purely inseparable polynomial 
in K(X], or equivalently, if the minimal polynomial of a over K is of type 
XP" —¢ forn €N andc€ K. Furthermore, L is called purely inseparable over 
K if every element a € L is purely inseparable over K in the sense just defined. 
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It is immediately clear from the definition that purely inseparable field ex- 
tensions are normal. The trivial extension K’/K is the only algebraic field exten- 
sion that is separable and purely inseparable at the same time. Also note that 
the extension F,,(t)/F,(t”) of the preceding section is an example of a nontrivial 
purely inseparable field extension. 


Proposition 2. For an algebraic field extension K C L, the following conditions 
are equivalent: 
(i) L is purely inseparable over K. 
(ii) There exists a family 2 = (ai)ier of elements in L such that L = K (Ql) 
and each a; 1s purely inseparable over K. 
(iii) (LD: K],=1. 
(iv) For every element a € L, there is an integer n € N such that a?” € K. 


Proof. The implication from (i) to (ii) is trivial. Next, to derive (iii) from (ii) 
it is enough to show that [A(a;) : A], = 1 for all i € J. The reason is that 
any K-homomorphism L —+ K into an algebraic closure K of K is uniquely 
determined by the images of the elements a;. On the other hand, the minimal 
polynomial of each element a; is of type X?" — c and therefore admits only a 
single zero in K. Hence, we get [K(a;) : K], =1 by 3.4/8, as desired. 

Now let us assume condition (iii) and derive (iv) from it. Choosing an ele- 
ment a € L, we get 


[L: K(a)],- [K(a): K], = [ee | = 


and thereby [K(a) : K], = 1. This means that the minimal polynomial of 
a over K admits only a single zero and hence by 3.6/2, that it is of type 
xX" — c, But then we conclude that a?” € K, as required in (iv). Finally, as- 
sume a?" € K for some a € L and hence that a is a zero of a polynomial of type 
X?" —c € K[X], which is a polynomial admitting only a single zero. Then the 
minimal polynomial of a over K is of the same type, and we see that a is purely 
inseparable over kK. This shows that (iv) implies (i). 


Corollary 3. Let K C LC M be algebraic field extensions. Then M/K is 
purely inseparable if and only if M/L and L/K are purely inseparable. 


Proof. [M:K],=[M:1L],-[L: K],; cf. 3.6/7. 


Next, we want to show that every algebraic field extension can be decom- 
posed into a separable extension, followed by a purely inseparable extension. 
For normal extensions, this is possible in reverse order as well. 


Proposition 4. Let L/K be an algebraic field extension. Then there exists a 
unique intermediate field K, of L/K such that L/K, is purely inseparable and 
K,/K is separable. The field K, is called the separable closure of K in L, i.e., 


k= {a € L; a separable over K}, 


3.7 Purely Inseparable Field Extensions 121 


and we have [L: K], = [K,: K]. If L/K is normal, the ertension K,/K is 


normal, too. 


Proposition 5. Let L/K be a normal algebraic field extension. Then there 
exists a unique intermediate field K; of L/K such that L/K; is separable and 
K;/Kis purely inseparable. 


Proof of Proposition 4. We write 
K,= {a € L; a separable over K}. 


Then Ky, is a field. Indeed, for a,b € K, we see from 3.6/9 that K(a,b) is a 
separable extension of K, so that K(a,b) C K,. Therefore, K, is the biggest 
separable extension of K that is contained in Z. Now consider an element a € L 
and let f € K,[X] be the minimal polynomial of a over K,. Then, by 3.6/2, 
there exists a separable polynomial g € K,[X] such that f(X) = g(X”’) for 
some exponent 7 € N. Moreover, g is irreducible, since f is irreducible. It follows 
that g is the minimal polynomial of c = a?” over Ky, and that c is separable over 
K,, hence by 3.6/11 also separable over K’. However, then we must have c € K, 
and therefore g = X — c, as well as f = X”" — c. Thus, a is purely inseparable 
over K,, and the same is true for L over Ks. 

Since L/K, is purely inseparable and K,/K is separable, we get the stated 
relation on degrees 


[L:K],=[L:K.].-[Ks: K]s = [Ke: K]. 


To justify the uniqueness of K,, consider an intermediate field K’ of L/K such 
that L/K’ is purely inseparable and K’/K is separable. Then we have A’ C K, 
by the definition of K,, and the extension K,/K’ is separable. On the other 
hand, the latter extension is purely inseparable, since L/K’ is purely insepara- 
ble. This shows that K,/K’ is trivial and hence that A, is unique, as claimed. 

It remains to show that K,/K is normal if the same is true for L/K. To 
do this, consider a K-homomorphism o: K, —+ L into an algebraic closure L 
of L. Since we can view L as an algebraic closure of K as well, we can extend 
o due to 3.4/9 to a K-homomorphism o’: L —+ L. Now, assuming L/K to be 
normal, o’ restricts to a K-automorphism of L. Furthermore, the uniqueness 
of K, implies that o restricts to a K-automorphism of K’,, and it follows that 
K,/K is normal. 


Proof of Proposition 5. Since the extension L/K is assumed to be normal, we 
can identify the set of K-homomorphisms of L into an algebraic closure L of DL 
with the set of K-automorphisms of L, the latter forming a group G. Let 


K;, = {a€ L; o(a) =a for alla € G} 


be the subset in Z that is left invariant under the members of G; it is easily 
verified that K; is a field, the so-called fixed field attached to G. Using 3.4/9, 
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every K-homomorphism K; —> L extends to a K-homomorphism L —> L. 
Since such a homomorphism leaves K; fixed due to its definition, we can conclude 
that # Hom, (K;, L) = 1. Hence, viewing L as an algebraic closure of K, we see 
that K;/K is purely inseparable. Indeed, the equivalence between (i) and (iii) 
in Proposition 2 shows that K; is the biggest purely inseparable extension of 
contained in L. To see that L/K; is separable, consider an element a € L, as 
well as a maximal system of elements o1,...,0, € G such that o;(a),...,0,(a) 
are distinct. Such a finite system will always exist, even if G is not finite, since 
a(a), for o € G, is a zero of the minimal polynomial of a over K. Also note 
that a will necessarily appear among the elements o;(a). Since every o € G 
induces a bijective self-map on the set {o1(a),...,0,-(a)}, we can conclude that 
the polynomial 


has coefficients in K;, since these are fixed by the elements of G. In particular, 
a is a zero of a separable polynomial in K;[X], and we see that a is separable 
over K;. Letting a vary over all of L, it follows that L/K; is separable. Finally, 
the uniqueness of K; is deduced, similarly as in Proposition 4, from the fact 
that K; is the biggest intermediate field of L/K that is purely inseparable over 
K. 


Exercises 


1. Let L/K be a field extension, and let a,b € L be elements that are purely insep- 
arable over K. Give an explicit argument showing that a+b and a-b are purely 
inseparable over K as well. 

2. Looking at a finite field extension L/K, its inseparable degree could be defined 
by [L: K]; = [L : K]-[L: K];z!. Discuss the drawbacks of this notion in 
comparison to the separable degree when formulating and proving the results of 
the present section on purely inseparable field extensions. 


3. Consider a simple algebraic field extension L/K and give a direct argument for 
the fact known from Proposition 4 that there exists an intermediate field K, such 
that L/K, is purely inseparable and K,/K is separable. 


4. Let K be a field of characteristic p > 0. Show that the Frobenius homomorphism 
a: K —> K, a+ > a’, is surjective if and only if K is perfect. 


5. Let L/K bea field extension and let a € L be separable over K, as well as 8 € L 
purely inseparable over kK. Show: 
(i) K(a,8) = K(a+ 8), 
(ii) K(a,8) =K(a- 8) ifax0#B. 
6. Let K be a field of characteristic p > 0. Show: 


(i) Given n € N there exists a field K?-" extending K with the following 
properties: If a € K? ", then a?” € K, and for every element b € K there 
exists an element a € K? " such that a?” = b. 
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(ii) The field K? ” is unique up to canonical isomorphism, and there are canon- 
ical embeddings K C KP C KP’ c.... 


(ii) APO SE KP is a perfect field. 
The field K? ~ is called the purely inseparable closure of K. 
7. Let L/K be an algebraic field extension. Show: 
(i) If K is perfect, the same is true for L. 
(ii) If L is perfect and L/K is finite, then K is perfect. 


Give an example showing that assertion (ii) will fail to be true in general if the 
finiteness of L/K is not assumed. 


8. Let L/K be a separable algebraic field extension. Show that the following condi- 
tions are equivalent: 


(i) Every nonconstant separable polynomial in L[X] admits a factorization 
into linear factors. 
(ii) Choosing an algebraic closure K of K and a K-embedding L © K, the 
extension K/L is purely inseparable. 
Show for a field k that there always exists an extension field L = Kgep satisfying 


the preceding conditions and that it is unique up to (noncanonical) isomorphism 
over K. The field Kgep is called a separable algebraic closure of K. 


9. Let L/K be a normal algebraic field extension in characteristic > 0. Consider 
the intermediate fields K, and K; as specified in Propositions 4 and 5. Show that 
L= K,(Kji) = Ki(Ks). 

10. Let L/K be an algebraic field extension with the property that every irreducible 
polynomial in A LX] admits at least one zero in L. Show that L is an algebraic 
closure of K. 


3.8 Finite Fields 


We are already familiar with the finite fields of type F, = Z/pZ for prime num- 
bers p; these are precisely the prime fields of characteristic > 0; cf. 3.1/2. In 
the following we want to construct for every nontrivial prime power q of p, say 
q =p” for some n > 0, a field F, consisting of g elements. However, note that 
such a field will be totally different from the residue class ring Z/p"Z for n > 1, 
since the latter admits nontrivial zero divisors and thus cannot be a field. 


Lemma 1. Let F be a finite field. Then p = charF > 0, and F contains F, 
as its prime subfield. Moreover, F consists of precisely q = p" elements, where 
n= [F: F,]. In addition, F is a splitting field of the polynomial X41 — X over 
F,, and it follows that the extension F/F, is normal. 


Proof. Since F is finite, the same is true for its prime subfield. Hence, the latter is 
of type F,,, where p = char F > 0. Furthermore, the finiteness of F shows that the 
degree n = [F : F,] is finite, and we see, for example using an F,-vector space 
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isomorphism F —~> (F,)", that F consists of g = p” elements. In particular, the 
multiplicative group F* is of order gq — 1. Therefore, every element in F* is a 
zero of the polynomial X71! — 1, and it follows that every element of F is a zero 
of the polynomial X4— X. Thus, all in all, F consists of g = p” zeros of X4—X 
and thereby of all zeros of this polynomial. It follows that X%— X factorizes 
over F into a product of linear factors, and we can conclude that F is a splitting 
field of the polynomial X? — X € F,[X]. 


Theorem 2. Let p be a prime number. For every integer n € N — {0} there 
exists an extension field F,/F, consisting of q = p"” elements. Furthermore, up 
to isomorphism, Fy ts uniquely characterized as a splitting field of the polynomial 
X?— X over F,. In fact, the elements of F, are recognized as the q different 
zeros of X1— X., 

Every finite field of characteristic p is isomorphic to precisely one of the 
finite fields of type Fy. 


Proof. Write f = X%— X. Since f’ = —1, the polynomial f does not admit 
multiple zeros and therefore has g simple zeros in an algebraic closure F,, of F,. 


If a,b € F, are two zeros of f, the binomial formula 3.1/3, 


(atb)?=al tb! =atb, 


implies that a+ 6 is a zero of f. Moreover, for b 4 0 the equation 
(ab~')4 = a2(b2)~! = ab 


shows that ab~! is a zero of f as well. Thereby we see that the q zeros of f 
in F, form a field consisting of q elements. In fact, it is a splitting field of f 
over F,, (constructed as a subfield of F,). This verifies the existence of a field 
of characteristic p consisting of g = p” elements. Finally, the stated uniqueness 
assertions can be derived from Lemma 1. 


In the following we fix a prime number p. In dealing with finite fields of 
characteristic p > 0, it is common practice to choose an algebraic closure F, of 
F,, and to view the fields F,» for n € N—{0} as subfields of F,, using 3.4/9. 
Since F,» is normal over F,,, we conclude from 3.5/4 (i) that F,», as a subfield 
of F,, is unique. 


Corollary 3. Embed the fields F, for q = p", n € N — {0}, into an algebraic 
closure F,, of F,. Then an inclusion F, C Fy holds for q = p" and q! = p”™ 


if and only if n|n'. Furthermore, the extensions of type F, C Fy are the only 
extensions between finite fields of characteristic p, up to isomorphism. 


Proof. Assume F, C Fy and let m = [Fy : Fy]. Then 


p” = #F = (#F,)” == ge 


3.8 Finite Fields 125 


and we see that n|n’. Conversely, if n’ = mn, consider an element a € Fy, 
ie., an element a € F, satisfying a? = a. Using a recursive argument, we 
obtain a” = a”) = (a2)@"™) = af”) = a and hence F, C Fy. That there 
cannot exist any further extensions between finite fields of characteristic p, up 
to isomorphism, follows from the extension result 3.4/9. Indeed, if F C F’ is an 
extension of finite fields in characteristic p, we can extend the inclusion F, C F, 
to a homomorphism F —> F, and that to a homomorphism F’ —> F,, so that 
up to isomorphism, we are reduced to the case F C F’ C F,. 


Corollary 4. Every algebraic extension of a finite field is normal and separable. 
In particular, finite fields are perfect. 


Proof. Let F C K be an algebraic field extension, where F is finite. If K is finite 
as well, say K = F, for q = p”, then K is a splitting field of the separable 
polynomial X%— X and therefore is normal and separable over F,, resp. F. If 
K is infinite, we can view it as a union of finite extensions of F. 


We have already seen in 3.6/14 that the multiplicative group of a finite field 
is cyclic. Therefore we can state the following: 


Proposition 5. Let q be a power of a prime number. Then the multiplicative 
group of Fy is cyclic of order q—1. 


Finally, let us look at a finite extension F,/F, of degree n and determine the 
automorphism group Autg,(F,), or in other words, the corresponding Galois 
group, as we will say in the next chapter. To do this, assume gq = p’ and 
qd =q" =p", and fix an algebraic closure F, of F,. Then 


Autg, (Fy) = Home, (Fy, F,), 
since F,/F, is normal, and furthermore, 
# Aute, (Fy) = [Fy : Fy]. = [Fy : Fy] =n, 
since F,/F, is separable. Now consider the Frobenius homomorphism 
a: Fy — Fy, ara, 


of Fy that was introduced in 3.1; concerning the additivity of a see 3.1/3. 
The rth power a” leaves F, invariant and is referred to as the relative Frobe- 
nius homomorphism on Fy over Fy. We claim that it is of order n. Indeed, 
o” € Autr, (Fy) is of order < n, since a?) = a for all a € Fy. On the other 
hand, if we had ordo” < n and hence e := orda < rn, then all elements 
a € Fy would be zeros of the polynomial X) — X, contradicting the fact that 
#Fy =p’ > p®. Therefore, it follows that Autp,(F,) is cyclic of order n, gen- 
erated by the relative Frobenius homomorphism o”. In particular, taking into 
account Corollary 3, we obtain the following result: 
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Proposition 6. Let F, be a finite field, q = p"’, and F/F, a finite field exten- 
sion of degree n. Then Autg,(F) is cyclic of order n, generated by the relative 
Frobenius homomorphism F —> F, at— a’. 


Exercises 


1. Explain why the extensions of type F,(t)/Fp(t?), for p prime and t a variable, are, 
so to speak, the “simplest” examples of field extensions that are not separable. 


2. Let F and F’ be subfields of a field L. Explain why we will have F =F’ if F and 
F’ are finite and consist of the same number of elements. 


3. Show for a prime number p and n € N—{0}: 
(i) An irreducible polynomial f € F,[X] is a divisor of XP" — X if and only if 
deg f is a divisor of n. 


(ii) X?" —X € F,[X] equals the product over all irreducible monic polynomials 
f € F,[X] such that deg f divides n. 


4, Show that Fp2% = U2, F,m is an algebraic closure of Fy. 


5. Let F, be an algebraic closure of F,,. Show that besides the powers of the Frobe- 
nius himionanep bien, there exist ee eee of F,. Hint: For prime 
numbers £ study the automorphisms of U?- Fy,, where q, = fy : 


3.9 Beginnings of Algebraic Geometry* 


So far we have looked at zeros of polynomials in one variable. Now we want to 
study zeros of polynomials in several variables with coefficients in a field K, and 
thereby have a first look at the interesting domain of algebraic geometry; for 
more details, consult [3] or any other book on the subject. As the name suggests, 
algebraic geometry applies abstract algebraic methods in a geometric setting. 
This is related to the fact that zero sets of polynomials in several variables are 
usually not finite and that it is a truly demanding venture to discover their 
structures. 

In the following, let X = (X1,..., Xn) be a system of variables and let K be 
an algebraic closure of the field K under consideration. For an arbitrary subset 
EF of the polynomial ring K[X] = K[X1,...,Xn], we can look at the set 


V(E) = {xe K"; f(x) =0 for all f € E} 


of common zeros in K” of the polynomials belonging to E; note that V(E) is 
referred to as an algebraic subset of K” that is defined over K. Conversely, 
given a subset U C Kk”, we can consider its corresponding ideal 


I(U) ={f € K[X]; f(U) =0} 


of all polynomials f vanishing on U. That I(U) is indeed an ideal in K [LX] is 
easily verified. Furthermore, we have V(£) = V(a) for a the ideal generated by 
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FE in K[X]; use the fact that a consists of all finite sums of type }> fje;, where 
fi € K[X] and e; € E. Let us list some elementary properties of the mappings 
V(-) and I(-): 


Lemma 1. For ideals ay, a2, resp. a family (a;)icr of ideals in K[X], as well 
as for subsets U,,U, CK”, we have: 
(i) ay C ag = > V(a,) D V(ay). 
(ii) U, Cc Up => I(U;) =) I(U2). 
(iii) Vase i) =(),V (a). 
(iv) V(a1- a2) = V(a1 Nae) = V(a1) UV (a2). 


Proof. The assertions (i), (ii), and (iii) are easy to verify; we show only how to 
obtain (iv). Since 
a-d2 Ca,Mag C aj, a= 1,2, 


we conclude from (i) that 
V(a1 + a2) D V(a, M ag) D V(ay) UV (ap). 


On the other hand, consider a point  € K" —(V(a,) UV(az)). Since « ¢ V (aj) 
for i = 1,2, there exists in each case an element f; € a; such that f;(x) 4 0. 
Using the fact that fi f2 belongs to a, + a2 and (fi f2)(x) = fila) - fo(a) does not 
vanish, we get x ¢ V(a,- a2). This shows that 


V(ay . a2) Cc V(a1) U V (ag), 


thereby justifying assertion (iv). 


The main objective of the present section is to discuss some of the deeper 
results on the mappings V(-) and J(-). To begin with, we want to show for 
every subset E C K[X] that there exist finitely many elements f,,..., f, € E 
satisfying V(E) = V(fi,..., f-). This means that every algebraic subset of K” 
that is defined over K can be viewed as the zero set of finitely many polynomials 
in K[X]. To justify this it is enough to prove that the ideal a that is generated 
by FE in K[X] is finitely generated. A ring with the property that all its ideals 
are finitely generated is called a Noetherian ring. 


Theorem 2 (Hilbert’s basis theorem). Let R be a Noetherian ring. Then the 
polynomial ring R[Y] in a variable Y is Noetherian as well. In particular, the 
polynomial ring KLX] = K[X1,...,Xn] in finitely many variables over a field 
K is Noetherian. 


In 2.4/8, a ring R was called Noetherian if every ascending chain of ideals 
a, C a2 C... C R becomes stationary after finitely many steps. Let us first 
show that this condition is equivalent to the fact that every ideal of R is finitely 
generated. Indeed, given a chain as specified before, we can consider the union 
a = LU, a; as an ideal in R. If a admits a finite system of generators f,..., f,, 
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then all f, and hence also a are contained in one of the ideals a;. Consequently, 
the chain of ideals becomes stationary at this position. Conversely, consider an 
ideal a C R that is not finitely generated. Then, for finitely many elements 
fi,---> fr € a we always have (f1,...,f,) # a. Thus, using an inductive con- 
struction, there exists in a an infinite strictly ascending chain of ideals. 


Proof of Theorem 2. Let R be a Noetherian ring and a C R[Y] an ideal. For 
iz © N define a; C R as the set of all elements a € R such that there exists a 
polynomial of type 


aY' + terms of lower degree in Y 


in a. It is verified without difficulty that every a; is an ideal in R and that we 
obtain an ascending chain 


acac...cR, 


since f € aimplies Y f € a. Now, using the fact that R is Noetherian, the chain 
becomes stationary, say at the position of the ideal a;,. Then, for 7 = 0,...,%0, 
choose polynomials f;; € a of degree deg f;; = i such that for fixed 7 the leading 
coefficients a;; of the f;; generate the ideal a;. We claim that the polynomials 
fi; will generate the ideal a. To justify this, consider a polynomial g € a, where 
we may assume g # 0. Furthermore, let a € R be the leading coefficient of g 
and write d = degg as well as i = min{d, io}. Then we get a € aj, and hence 
there is an equation 
a= Scag, Cj E R. 
j 


Next, observe that the polynomial 


gn =g-Yo*- C5 fij 
j 

belongs to a again and that its degree is strictly less than the degree d of g, 
since the coefficient of Y“ has become trivial now. If g, 4 0, we can apply the 
same process to g; in place of g again, and so forth. After finitely many steps 
we arrive at a polynomial g, that is zero, and the process stops. It follows that 
g is a linear combination of the f;; with coefficients in R[Y], and we see that 
the f,; generate a. 


Given an ideal a of a ring R, we can always consider its radical 
rada = {a € R; there exists some n € N such that a” € a}. 


Using the binomial formula, it is easily seen that the radical of a is itself an 
ideal in R. Ideals with the property that a = rada are called reduced. For every 
subset U C K", its corresponding ideal I(U) C K[X] is reduced. Indeed, a 
polynomial f € KX] vanishes at a point 2 € K” if and only if some power f’, 
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where r > 0, vanishes at x. We want to have a closer look at the relationship 
between ideals in A LX] and algebraic sets in K”. 


Proposition 3. The mappings I(-) and V(-) define mutually inverse and 
inclusion-inverting bijections 


{algebraic subsets C K"} — {reduced ideals C K[X]}, 
V 


where, in more precise terms, on the left-hand side we mean algebraic subsets 


of K" that are defined over K. 


To carry out the proof we have to establish the equations 
V(I(U)) =U, I(V(a)) =a, 


for algebraic subsets U Cc K” and reduced ideals a C K[X]. The first 
of these is of elementary nature. To justify it, assume U = V(a) for some 
ideal a C K[X]. We then have to show that V([(V(a))) = V(a). Since 
all polynomials in a vanish on V(a), we see that a C I(V(a)) and therefore 
V(a) D V(I(V(a))). On the other hand, all polynomials in [(V(a)) vanish on 
V(a), so that V(a) C V(I(V(a))) and hence V(I(V(a))) = V(a). The second 
equation [(V(a)) = a is more involved; it is a special case of a result referred 
to as Hilbert’s Nullstellensatz (German for Hilbert’s theorem on the zero locus 
of polynomials): 


Theorem 4 (Hilbert’s Nullstellensatz). Let a be an ideal of the polynomial ring 
K[X] = K[X%,...,Xn] and let V(a) be the corresponding zero set in K”. 
Then I(V(a)) = rada. In other words, a polynomial f € K[X] vanishes on 
V(a) precisely when a power f" belongs to a. 


We start by proving a lemma known as a weak version of Hilbert’s Nullstel- 
lensatz. 


Lemma 5. Let A= K[a1,...,%n] #0 be a ring of finite type over a field K. 
Then the inclusion K + K can be extended to a K-homomorphism A — K. 


Proof. For a maximal ideal m C A, consider the canonical map K —> A/m. 
Since A/m is a field that is of finite type over K in the ring-theoretic sense, we 
conclude from 3.3/8 that A/m is finite over AK. Then 3.4/9 shows that there 
exists a K-homomorphism A/m—> K, and the composition of the latter with 
the projection A —+ A/m yields the desired K-homomorphism from A to K. 


Now we come to the proof of Theorem 4. Since all polynomials of a vanish 
on V(a), we obtain a C I(V(a)) and even rada C I(V(a)), since all ideals of 
type I(U) are reduced. To derive the opposite inclusion we proceed indirectly. 
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Assume that there is a polynomial f € [(V(a)) such that f” ¢ a for all r EN. 
Then the multiplicative system S = {1, f, f?,...} is disjoint from a. Using Zorn’s 
lemma 3.4/5 (or alternatively, the fact that K [LX] is Noetherian), there exists 
an ideal p C KX] that is maximal among all ideals q C K[X] such that a C q 
and qn S =. We claim that p is a prime ideal. Indeed, choose a,b € K[X]—p. 
By the choice of p, the ideals (a,p) and (b,p) that are generated by a and p, 
resp. b and p, in K[X] must have a nonempty intersection with S,, so that 


SN (ab,p) > SN ((a, p) . (b, p)) # i). 


In particular, ab ¢ p and p is a prime ideal. 

Next, consider the residue class ring A = K[X]/p as a ring extension 
of finite type of K. Let f € A be the residue class of f. Since f ¢ p 
by the choice of p and since A is an integral domain, we can consider the 
subring A[f~!] of the field of fractions Q(A). Using Lemma 5, there is a 
K-homomorphism A[ f Tt) — K, and composing it with canonical maps, we 
arrive at a K-homomorphism 


yp: K[X] + Ao A[f] > K. 


Now look at the point z = (y(X1),...,¢(Xn)) € K”. We may view the map y 
as the substitution homomorphism evaluating polynomials of A LX] at «. Since 
acpcC kery, it follows that x € V(a). On the other hand, f(x) = ¢(f) is 
nonzero, since it is the image of the unit f € A[f~!]. However, f(x) 4 0 for 
a point « € V(a) contradicts the fact that f € [(V(a)) by our choice of f. In 
particular, the assumption that there is no power of f belonging to a cannot be 
maintained, and we see that [(V(a)) C rada. 


For an algebraically closed field K, the algebraic subsets of kK” correspond- 
ing to maximal ideals m C K[X] are particularly simple to describe, since they 
are reduced to the one-point sets in K”: 


Corollary 6. Let K be an algebraically closed field. An ideal m of the polynomial 
ring K(X] = K[X,...,X,] is maximal if and only if there exists a point 
u = (X41,...,Un) € K” such that m = (Xy — 2,...,Xn — £n). In particular, 
V(m) = {x} and I(x) =m in this case. 

Therefore, if K is algebraically closed, the maximal ideals in KX] corre- 
spond bijectively to the points of K” under the bijection mentioned in Proposi- 
tion 3. 


Proof. First note that (X1,..., Xn) C K LX] is a maximal ideal, since the residue 
class ring A [X]/(X1,..., Xn) is isomorphic to K. In the same way, we see that 
ideals of type (X1 — 71,...,Xn—%n) C K[X] for points « = (2,...,2) in 
K” are maximal as well; just use a K-automorphism on K[X] transforming 
the variables X,,...,X, to X, — %,...,Xn — Zn. Now consider an arbitrary 
maximal ideal m C K[X]. Due to Lemma 5, there is a A-homomorphism 
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K[X]/m — K, where the latter is necessarily an isomorphism, since K LX ]/m 
is a field extending AK. Composing this isomorphism with the canonical projec- 
tion KLX] — K[X]/m, we arrive at an epimorphism KX] —> K admitting 
m as its kernel. For 2 = 1,...,n, let x; € K be the corresponding image of X;. 
Then we get X; — x; € m for all 7, and we see that m must coincide with the 
ideal (X, —21,..., Xn—n), since that ideal is already maximal in K [|X]. This 
establishes the desired characterization of maximal ideals in A [LX], while the 
remaining assertions are easily deduced from this one. 


More generally, one can show for a not necessarily algebraically closed field 
K that an ideal in K [|X] is maximal if and only if it is of type [({x}) for a point 
x € K”; cf. Exercise 2. However, the set {x} does not necessarily form an alge- 
braic subset of K” that is defined over K, and furthermore, x is in general not 
uniquely determined by the corresponding maximal ideal [({2}) C AK[LX]. For 
example, every K-automorphism 0: K —+ K maps the point x = (a1,...,%n) 
to a point o(z) := (o(21),...,0(@n)) satisfying I({x2}) = I({o(x)}). The small- 
est algebraic subset of K" containing x and defined over K is V(I{x}), which 
one can show consists of all points o(x) for 7 varying over the K-automorphisms 
of K. 

Now consider an ideal a C K LX] and its associated algebraic set V(a) C K”. 
Viewing polynomials of K [X] as K-valued functions on K", we can restrict this 
domain to the algebraic set V(a). This restriction process gives rise to a ring 
homomorphism K[X] —+ Map(V(a), A’) whose kernel contains a. In particu- 
lar, the elements of the residue class ring K[X]/a may canonically be viewed 
as “functions” on V(a); the ring K [|X ]/a is referred to as the ring of polynomial 
functions (modulo a) on the algebraic set V (a). Proceeding like this, a little bit 
of care is necessary, since the map K[X]/a —+ Map(V (a), ’) will not be injec- 
tive in general. For example, nilpotent elements in K [X]/a give rise to the zero 
function on V(a), and one can deduce from Hilbert’s Nullstellensatz that these 
are the only elements in K|X]/a with this property. Indeed, the kernel of the 
map K[X] —> Map(V(a), K) is given by the ideal rada, which implies that 
the kernel of the induced homomorphism K[X]/a —> Map(V(a), K) equals 
the radical of the zero ideal in Kk [.X]/a, which consists of all nilpotent elements 
in K[X]/a. 


Exercises 


Let K be a field, K an algebraic closure of K, and X = (Xj,..., Xn) a system 
of variables. 


1. For subsets EC K[X] and U C K” set 


V«(E) = {x € K”; f(z) =0 for all f € E}, 
1(U) = (f € K[X1; f(U) = 0}. 
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Review the results of the present section and examine which of them remain valid, 
and which not, if we consider zeros of polynomials f € K[X] merely in K” and 
not in K", in other words, if we use the mapping Vix(-) instead of V(-). 


. Consider the substitution homomorphism h,: K[X] —> K, f -> f(z), for 


elements 2 € K”. Show that the ideals of type ker h, are precisely the maximal 
ideals of K LX]. 
Let m Cc A [X] be a maximal ideal. Show that m = (f1,..., fn) for polynomials 


fi,---,fn, where f; for each 7 is a monic polynomial in X; with coefficients in 
KLX,..., X¢-1]. 


Let U C K” bean algebraic subset defined over K. Then U is said to be irreducible 
over K if there does not exist a decomposition U = U,U U2 for algebraic subsets 
U,,U2 ¢ U that are defined over K. Show: 


(i) U C K” is irreducible over K if and only if the corresponding ideal I(U) is 
prime in K[X]. 

(ii) There exists a decomposition U = Uy U...UU, of U into algebraic subsets 
that are defined and irreducible over kK. For decompositions that cannot be 
shortened, the U;,...,U;, are unique, up to numeration. 


Let A be a K-algebra of finite type. Show that A is a Jacobson ring, i.e., that 
every reduced ideal a ¢ A is an intersection of maximal ideals. 


4. Galois Theory ®) 


Check for 
updates 


Background and Overview 


In Chapter 3 we saw that every field K admits an algebraic closure K and 
that this closure is unique up to K-isomorphism. Hence, given an algebraic 
equation f(a) = 0 for a nonconstant polynomial f € K[X], we know that f 
factorizes over K into linear factors. In particular, K contains “all” solutions 
of the algebraic equation f(x) = 0. The subfield L C K generated over K by 
these solutions is a splitting field of f, and the extension L/K is finite, as well 
as normal; see 3.5/5. Alternatively, a splitting field Z of f can be obtained in 
terms of Kronecker’s construction, by successively adjoining all solutions of the 
equation f(x) = 0 to K. If we want to clarify the “nature” of the solutions, for 
example, if we want to see whether we can solve the equation by radicals, the 
structure of the extension L/K has to be studied. 

At this point Galois theory comes into play, with its group-theoretic meth- 
ods. At the center of interest is the group Aut,(L) of all K-automorphisms 
of L. If L/K is separable and thus a Galois extension, the group Aut,(L) is 
referred to as the Galois group of L/K and is denoted by Gal(L/K). Every 
K-automorphism L —+ LF restricts to a bijective self-map on the zero set of f 
and, in fact, is uniquely determined by the images of these zeros. Therefore, the 
elements of Aut; (Z) can readily be identified with the corresponding permuta- 
tions on the zero set of f. Interpreting K as an algebraic closure of L, we can just 
as well view Auty(L) as the set of all K-homomorphisms L —+ K; see 3.5/4. 
Also note that the results 3.4/8 and 3.4/9 provide an explicit description of 
those homomorphisms. For example, let us assume that f does not admit mul- 
tiple zeros, or more generally that L, as a splitting field of f, is separable over 
K. Then the extension L/K is simple by the primitive element theorem 3.6/12, 
say L = K(qa), and the minimal polynomial g € K[X] of a factorizes over 
L into linear factors, due to 3.5/4. The corresponding zeros a1,...,Q@, € L 
satisfy L = K(a;), and there is a unique automorphism o; € Aut, (Z) such 
that o;(a@) = a; for each 7; see 3.4/8. This automorphism is characterized by 
h(a) ++ h(a;) for polynomials h € K[X]. It follows that the Galois group 
Gal(L/K) consists of the elements o1,...,0, where their number n equals the 
degree of g, resp. the degree of the extension L/K. Such an explicit description 
of automorphism groups Autx(L) was discovered by Galois himself, and it is 
for this reason that these groups are referred to as Galois groups. 
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The first basic result we will prove in the present chapter is the so-called 
fundamental theorem of Galois theory; see Section 4.1. It asserts for a finite 
Galois extension L/K that the subgroups H of the corresponding Galois group 
Gal(L/K’) correspond bijectively to the intermediate fields E of L/K via the 
mappings H ++ L#, resp. E ++ Autg(L); here L” denotes the subfield 
consisting of all elements in L that are invariant under the automorphisms 
belonging to H. Furthermore, an intermediate field EF of L/K is normal over 
if and only if Autg(Z), as a subgroup of Gal(L/K), is normal. This is only one 
facet of the quite general fact that the Galois group Gal(L/K’) encodes several 
properties of the extension L/K. In particular, the problem to determine all 
intermediate fields of L/K simplifies to that of determining all subgroups of 
Gal(L/K). 

In Section 4.2 we generalize the fundamental theorem of Galois theory to 
Galois extensions that are not necessarily finite. To do this we interpret Ga- 
lois groups as topological groups following W. Krull, and look especially at 
their closed subgroups. As an example, we determine the absolute Galois group 
Gal(F/F) of a finite field F, where F is an algebraic closure of F. Next, in 4.3, 
some examples are considered showing how the Galois group of algebraic equa- 
tions can be worked out in special cases. Also we prove at this place that the 
generic equation of degree n admits the full permutation group G,, as its Galois 
group. Related considerations lead us to the fundamental theorem on symmetric 
polynomials, whose more advanced version is derived in 4.4. As an application 
we study the discriminant of a polynomial f, which is nonzero precisely when f 
does not admit multiple zeros. Also we consider the resultant of two polynomials 
as a possible means for computing discriminants. 


The purpose of Sections 4.5—4.8 is essentially to prepare the characterization 
of the solvability of algebraic equations by radicals, although a final treatment 
of this subject has to be postponed until Chapter 6. In 4.5 and 4.8 we study 
so-called radical extensions, i.e., extensions that occur by adjoining solutions 
of pure equations of type x” — c = 0. For c = 1 this concerns nth roots of 
unity, i.e., nth roots of 1. In the remaining cases, we are dealing with cyclic 
extensions, i.e., Galois extensions with cyclic Galois group, provided we assume 
that the coefficient field AK contains all nth roots of unity. Certain modifications 
are necessary if the characteristic of the field AK under consideration divides 
n. As an auxiliary tool, we prove the theorem on the linear independence of 
characters in 4.6 and study subsequently, in 4.7, the norm and trace for finite 
field extensions. E. Artin based his set-up of Galois theory on techniques of 
this kind, see [1] and [2], while we have preferred to follow a more conventional 
approach in Section 4.1. 

In Sections 4.9 and 4.10 we generalize the characterization of cyclic exten- 
sions to certain classes of abelian extensions, called Kummer extensions of some 
given exponent n, named after E. Kummer. First, in 4.9, we assume that the 
characteristic of the field under consideration does not divide n; this is the easi- 
est case. Then, in 4.10, we explain Kummer theory from a more axiomatic point 
of view, applying it to study Kummer extensions of exponents p” over fields of 
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characteristic p > 0. As a necessary technical tool, we explain the formalism of 
Witt vectors, which was introduced by E. Witt. 

The chapter ends in 4.11 with an example of descent theory. Considering a 
finite Galois extension L/K, its aim is to describe K-vector spaces in the style 
of the fundamental theorem of Galois theory as fixed spaces of L-vector spaces 
that are equipped with a certain action of the Galois group Gal(L/K). 


4.1 Galois Extensions 


An algebraic field extension L/K is called normal, see 3.5, if L is a splitting 
field of a family of polynomials in K[X], or equivalently, if every irreducible 
polynomial in K[X] having a zero in L decomposes over L into a product of 
linear factors; cf. 3.5/4 (ii) and (iii). In the sequel the remaining characterizing 
property of normal extensions 3.5/4 (i) will play a fundamental role, namely 
that on choosing an algebraic closure L of L, every K-homomorphism L —> I 
restricts to an automorphism of L. Then, viewing LE as an algebraic closure K 
of K, the set Homx(L, K) of all K-homomorphisms of L to K can be identified 
with the group Autx(Z) of all K-automorphisms of L. Let us add along the 
way that two elements a,b € L are said to be conjugate (over K) if there exists 
an automorphism o € Aut;,(Z) such that o(a) = b; however, we will use such 
terminology only on rare occasions. 


Definition 1. An algebraic field extension L/K is called Galois if it is normal 
and separable. Then Gal(L/K) := Autx(ZL) is called the Galois group of the 
Galois extension L/K. 


In the literature, normal field extensions are also referred to as quasi- Galois 
extensions. Looking at a splitting field of a separable polynomial over a field 
kK, we obtain an example of a finite Galois extension. Furthermore, as we saw 
in 3.8/4, every algebraic extension F/F, of a finite field F,, where q is a prime 
power, is Galois. If F/F, is finite, the corresponding Galois group Gal(F/F,) 
is cyclic of order n = [F : F,] and is generated by the relative Frobenius 
homomorphism F —> F, a +> a‘; cf. 3.8/6. 


Remark 2. Let L/K be a Galois extension and E an intermediate field. Then: 
(i) The extension L/E is Galois and the Galois group Gal(L/E) is naturally 
a subgroup of Gal(L/Kx). 
(ii) If E/K is Galois as well, then every K-automorphism of L restricts 
to a K-automorphism of E and Gal(L/K) —> Gal(E/K), ¢ -> o|p, is a 


surjective group homomorphism. 


Proof. It follows from 3.5/6 and 3.6/11 that the extension L/E is Galois. Since 
each E-automorphism of L is at the same time a K-automorphism, we recognize 
Gal(L/F) as a subgroup of Gal(L/). Furthermore, if E/K is Galois, then ev- 
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ery K-automorphism of L restricts to a K-automorphism of FE, due to 3.5/4 (i). 
Thereby one obtains a group homomorphism Gal(L/K) —> Gal(£/K), which 
is surjective by 3.4/9 due to the fact that L/K is normal. 


Combining the defining properties of the separable degree with the results 
3.6/8 and 3.6/9, we have the following: 


Remark 3. Let L/K be a finite normal field extension. Then 
ord Auth (L) = [L: kK], <[L: K]. 


In particular, ord Autk(L) = [L : K] is equivalent to the fact that L/K is 
separable. 


As it will turn out, a fundamental property of Galois extensions L/K con- 
sists in the fact that A is the invariant or fixed field of the Galois group 
Gal(L/K), ie., K equals the set of all elements of L that are left invariant 
under all automorphisms in Gal(L/K). To prove this assertion, which is part 
of the fundamental theorem of Galois theory, we start by studying fixed fields 
that are constructed with respect to the action of automorphism groups. 


Proposition 4. Let L be a field and G a subgroup of Aut(L), the group of 
automorphisms of L. Furthermore, consider 


K=L° = {a€L; o(a) =a for alla € G}, 


the fixed field attached to G. 

(i) If G is finite, then L/K is a finite Galois extension, in fact, of degree 
[L: K] =ordG and with Galois group Gal(L/K) = G. 

(ii) If G is infinite, but L/K is known to be algebraic, then L/K is an infinite 
Galois extension with Galois group Gal(L/K) containing G as a subgroup. 


Proof. First, it is easily checked that K = L© is indeed a subfield of L. Now 
assume that G is finite, or if not, that L/K is algebraic. To see that L/K is 
separable algebraic, consider an element a € L, as well as a maximal system 
of elements o1,...,0, € G such that o\(a),...,¢,(a) are distinct. Such a finite 
system exists always, also in the case that G is infinite and the extension L/K 
is known to be algebraic. Indeed, in the latter case we see for each o € G that 
a(a) is a zero of the minimal polynomial of a over K. Also observe that the 
element a itself will occur among the o;(a). Every o € G gives rise to a self-map 
on the set {o1(a),...,0,-(a)} that is necessarily bijective, and it follows that the 
polynomial 


,= I] _ o;(a)) 


has coefficients in A’, since these are left invariant by G. In particular, a is a 
zero of a separable polynomial in KX] and hence is separable algebraic over 
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Kk. The same is then true for the extension L//’. Furthermore, L/K is normal, 
since L is a splitting field over K of all polynomials f of the type just considered. 
Thereby we see that L/K is a Galois extension. 

Now let n = ordG, where n = o9 is not excluded. Then the argumenta- 
tion given before shows that [K(a) : K] <n for every a € L. This implies 
[L: K] <n if we apply the primitive element theorem 3.6/12 to subfields of 
L that are finite over K. Since G is a subgroup of Aut, (ZL) = Gal(L/K), we 
deduce from Remark 3 that 


n=ordG < ord Gal(L/K) <[L:K]<n 


and therefore that ordG = [L: Kk]. In addition, for n < co we can conclude 
that G = Gal(L/K). 


Corollary 5. Let L/K be a normal algebraic field extension with automorphism 
group G = Aut(L). Then: 
(i) L/L® is a Galois extension with Galois group G. 
(ii) If L/K is separable and therefore Galois, then L° = K. 
(iii) Assume char K > 0. Then L& is purely inseparable over K, and the 
chain K C L© CL coincides with the chain K C K; C L of 3.7/5. 


Proof. We know by Proposition 4 that L/L° is a Galois extension. The corre- 
sponding Galois group coincides with G in this case, since Aut;¢(L) = Autx(L). 
Furthermore, the definition of L? shows that [L° : K], = 1. Indeed, if K is an 
algebraic closure of K containing L, then using 3.4/9, every K-homomorphism 
L°& —+ K extends to a K-homomorphism L —+ K or, since L/K is normal, 
to a K-automorphism of L. However, all K-automorphisms of F are trivial on 
L°. Now if L/K is separable, the same is true for L°/K, and we get L° = K, 
since [L° : K] = [L° : K], = 1. On the other hand, if L/K is not separable 
(for char K > 0), we see from 3.7/2 that L°/K is purely inseparable. That the 
chain K C L° C L coincides with the one of 3.7/5 follows from the uniqueness 
assertion in 3.7/5. 


Theorem 6 (Fundamental theorem of Galois theory). Let L/K be a finite Galois 
extension with Galois group G = Gal(L/K). Then the maps 


{subgroups of G } {intermediate fields of L/ Kh, 


H ——> L# 


Gal(L/E) ~———— E, 


? 


that assign to a subgroup H C G the fixed field L", resp. to an intermediate 
field E of L/K the Galois group of the Galois extension L/E, are bijective and 
mutually inverse. 

The fixed field L® is normal and therefore Galois over K if and only if H is 


a normal subgroup of G. In the latter case, the surjective group homomorphism 
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G —+ Gal(L"/K), 
o> olpe, 


admits H as its kernel and hence induces an isomorphism 


G/H + Gal(L"/K). 


Remark 7. [f in Theorem 6 we do not require that the Galois extension L/K 
be finite, we still get BoW = id; in particular, ® is surjective and W injective. 
However, for a subgroup H C G, its image (Wo@)(H) will in general be different 
from H itself; cf. 4.2/3 or 4.2/4. 

The second part of Theorem 6 remains valid for arbitrary Galois extensions 
if we restrict ourselves to subgroups H C Gal(L/K) satisfying the condition that 
(Wo &)(H) = H, or equivalently, that H = Gal(L/L"). These are the closed 
subgroups of Gal(L/Ix); cf. 4.2. 


Proof of Theorem 6 and Remark 7. Let L/K be a Galois extension that is not 
necessarily finite. If F is an intermediate field of L/K, then L/E is Galois and 
the Galois group H = Gal(L/F) is a subgroup of G = Gal(L/k); cf. Remark 2. 
Using Corollary 5 (ii), we get E = L”, so that 6o W = id, as claimed. No 
finiteness condition on L/K is used for this argument. Now consider a subgroup 
H CG and look at the intermediate field E = L” of L/K. If G is finite, the 
same is true for H, and we obtain H = Gal(L/E) from Proposition 4, or in 
other words, Yo @ = id. In particular, @ and W are bijective and mutually 
inverse to each other. 

Next consider a subgroup H C G and assume in view of Remark 7 that 
H = Gal(L/L"); this is automatically the case if L/K is a finite Galois ex- 
tension, as we just have seen. If L“/K is normal, there is a surjective group 
homomorphism 


yp: G— Gal(L"/K), 
or>olpy, 


by Remark 2. In particular, ker y consists of all AK-automorphisms of LD leav- 
ing L” fixed, ie., kerry = Gal(L/L"”) = H. Being the kernel of a group ho- 
momorphism, H is a normal subgroup in G and y induces an isomorphism 
G/H + Gal(L"/K) by the fundamental theorem on homomorphisms 1.2/7. 

Conversely, assume that H is a normal subgroup in G. Choosing an algebraic 
closure L of L, it is at the same time an algebraic closure of K and of L”. To 
see that L” /K is normal, consider a K-homomorphism o: L” —+ LF and let us 
show that ¢(L”) = L". To do this extend o to a K-homomorphism o’: L —> T, 
using 3.4/9. Since L/K is normal, o' restricts to an automorphism of L, and we 
can view o as a K-homomorphism L” —> L. Now let b € o(L”), say b = o(a) 
for some a € L”. To check that b € L”, we have to show that b is fixed by all 
automorphisms of H. Therefore, let 7 € H. Using Ho = oH, due to the fact 
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that H is a normal subgroup of G, there exists an element 7’ € H such that 
Tog =00T"'. Hence, we get 


7(b) = Toa(a) =a07T'(a) =a(a) =b, 


ie., be L”, since a € L". It follows that o(L”) Cc L”. Furthermore, extending 
o-!: o(L®) —> L¥ to a K-homomorphism p: L” —> L, using 3.4/9, we see 
in the same way that p(L”) c L”. It follows that o(L”) = L", as claimed. 


Next, let us discuss some consequences of the fundamental theorem of Galois 
theory. 


Corollary 8. Every finite separable field extension L/K admits only finitely 
many intermediate fields. 


Proof. Passing to a normal closure of L/K, see 3.5/7, we may assume that L/K 
is finite and Galois. Then the intermediate fields of L/K correspond bijectively 
to the subgroups of the finite group Gal(L/K). 


To be able to formulate our next result, we define for two subfields F, E’ of 
a field L their composite field E'. E’. It is the smallest subfield of D containing 
E as well as E’. Of course, we may view FE’: E” as being obtained by adjoining 
all elements of E’ to E, or likewise, by adjoining all elements of E to E’, i.e., 
EE = E(B") = E'(E). 


Corollary 9. Let L/K be a finite Galois extension. For intermediate fields E 
and E’ of L/K, consider H = Gal(L/E) and H' = Gal(L/E") as subgroups of 
G = Gal(L/K). Then: 
(i) ECE SHAD H". 
Gi) 2-2", 
(iii) EAE! = L”", where H" is the subgroup of G generated by H and H'. 


Proof. (i) If E c E”, then every E£’-automorphism of L is an E-automorphism 
as well, i.e., we have H = Gal(L/F) D Gal(L/E") = H’. On the other hand, we 
see that H > H’ implies EF = L® c LE’ = E’. 
(ii) Clearly, we have E- E’ C L#O#", as well as Gal(L/E- E’) C HOH’. 
From the latter inclusion we conclude that EE’ > L!”" with the help of (i). 
(iii) We have L?” = LEN L? = ENE’. 


Definition 10. A Galois extension L/K is called abelian (resp. cyclic) if the 
Galois group Gal(L/K) is abelian (resp. cyclic). 


Examples of cyclic, and hence abelian, Galois extensions are easy to obtain, 
since we can read from 3.8/4 and 3.8/6 that every extension between finite fields 
is cyclic. 
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Corollary 11. Let L/K be a finite abelian (resp. cyclic) Galois extension. Then, 
for every intermediate field E of L/K, the extension E/K is a finite abelian 
(resp. cyclic) Galois extension. 


Proof. In each case, Gal(L/£) is a normal subgroup in Gal(Z/K‘), since cyclic 
groups are abelian. It follows that the extension E'/K is Galois. Furthermore, 
the Galois group Gal(£/K) = Gal(L/K)/Gal(L/E) is abelian, resp. cyclic, if 
the group Gal(L/K) is of this type. 


Proposition 12. Let L/K be a field extension together with intermediate fields 
E and E' such that E/K and E"/K are finite Galois extensions. Then: 
(i) E- E’ is finite and Galois over K, and the homomorphism 


y: Gal(E- E'/E) — Gal(E'/EN E’, 
o->olp, 


is an isomorphism. 
(ii) The homomorphism 


w: Gal(E- B'/K) —s Gal(B/K) x Gal(E’/K), 
ot— (ol|n, lz), 


is injective. In addition, if EN E’ = K, then w is surjective and hence an 
isomorphism. 


Proof. We start with assertion (i). First, using the fact that EF. E’ = K(E, E’), 
we see that E’- E’ is normal, separable, and finite over K, since F/K and E’/K 
admit these properties. To show that y is injective, observe that o|~ = id for 
every o € Gal(E’- E’/E). In addition, for o € ker y we conclude that o|g = id 
and hence that o is trivial on E- E’. To derive the surjectivity of y we apply 
the fundamental theorem of Galois theory and consider the equation 


(ae _ (E . pene) NE =EN E’, 


which implies im y = Gal(E"/EN E’), as desired. 

The injectivity of w in assertion (ii) is easy to obtain. Just observe that 
every K-automorphism o € kerw is trivial on EF and on E’, therefore also on 
E.- E’. Concerning the surjectivity of ~, assume EM EE’ = K and consider an 
element (0, 0’) € Gal(E/F) x Gal(E’/K). By (i) we can extend o’ € Gal(E’/K) 
to an automorphism 0’ € Gal(E’- E’/F’) such that o’|~ = id. Likewise, we can 
extend o to an automorphism o € Gal(E£- E’/K) such that o|g = id. Then 
a 00’ is a preimage of (a,0’) with respect to w, since 


(F06')|p =6|n06'|np=o 
and 


(6 06")|p = 6| p06" |p =a’. 


4.1 Galois Extensions 141 


Exercises 


1. 


10. 


11. 


What sort of information can be deduced from the fundamental theorem of Galois 
theory for finite algebraic field extensions? 


. Indicate how the fundamental theorem of Galois theory could be extended to the 


context of finite quasi-Galois extensions. 


. Show that an algebraic field extension L/K is Galois if and only if K equals the 


fixed field of the automorphism group Aut (L). 


. Construct a field L together with a subgroup G C Aut(L) such that L/L is not 


a Galois extension. 


. Let L/K be a finite Galois extension and H C Gal(L/K) a subgroup. 


(i) Consider an element a € L such that the equation o(a@) = a for an auto- 
morphism ¢ € Gal(L/K) is equivalent to ¢ € H. Show that L” = K(a). 


(ii) Justify that associated to H, there is always an element a € L as in (i). 


. Let K bea field, f € KX] an irreducible separable polynomial, and L a splitting 


field of f over K, so that L/K is a finite Galois extension. If L/K is abelian, 
show that L = K(a) for every zero a € L of f. 


. Consider an algebraically closed field L and an automorphism o € Aut(L). Let 


K = CL’ be the fixed field under o. Show that every finite field extension of K is 
a cyclic Galois extension. 


. For a Galois extension L/K, consider an element a € L— K as well as an 


intermediate field K’ that is maximal with respect to the condition that a ¢ K’. 
Show for every intermediate field E of L/K’' satisfying [E : K'] < oo that E/K’' 
is a cyclic Galois extension. 


. Let K bea field and K an algebraic closure. Show: 


(i) If B;, i € I, is a family of intermediate fields of K/K such that E;/K is an 
abelian Galois extension for every i, then K(Uj;-7 Ei) is an abelian Galois 
extension of It as well. 

(ii) There exists a maximal abelian Galois extension K,,/K. It is characterized 
by the following properties: (a) K,/K is an abelian Galois extension. (b) 
For any further abelian Galois extension L/K, the field L is isomorphic over 
K to an intermediate field of Kap/K. 

(iii) Any two maximal abelian Galois extensions of the type discussed before are 
isomorphic over K. 


For a finite Galois extension L/K, consider intermediate fields L, Lz correspond- 
ing to subgroups Hy, H2 C Gal(L/K). Show for an automorphism o € Gal(L/K) 
that o(L1) = Le is equivalent to the equation oH,o~! = Ho. 


Show that L = Q(,./pr,.-.,./Pn) for distinct prime numbers p),...,pn is an 
abelian Galois extension of Q with Galois group (Z/2Z)”". Hint: Observe for 
a € Q with Va € L, and for o € Gal(Z/Q), that o(\/a) = +Va. From a 
more general point of view, this is an example of Kummer theory, to be dealt 
with in 4.9. Considering the multiplicative subgroup M C Q* that is generated 
by pi,...,Pn, the quotient M/M? can be viewed as a subgroup of the group 
Hom(Gal(L/Q), Z/2Z) of all group homomorphisms from Gal(L/Q) to Z/2Z. 
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4.2 Profinite Galois Groups* 


In the preceding section we considered Galois theory mainly for finite field 
extensions. We want to remove this restriction now and study some addi- 
tional phenomena that occur when one is working with nonfinite Galois ex- 
tensions. Given an arbitrary Galois extension L/K, we can look at the system 
£ = (Li)ier of all intermediate fields of L/K that are finite and Galois over 
K. Let fj: Gal(L/K) —> Gal(L;/K) be the restriction homomorphism ac- 
cording to 4.1/2. Then every o € Gal(Z/K’) determines a family of Galois 
automorphisms (0;);er, where 0; = o|,, = fi(o) and o;|z, = 0; if L; C L;. Con- 
versely, every family (0;)ier € [],¢7 Gal(L;/K) satisfying the compatibility re- 
lation o;|_, = 0; for L; C L; gives rise to a well-defined element o € Gal(L/Kx). 
Two facts are responsible for this. First, observe that D is the union of all fields 
L; € &, since for every a € L, the normal closure of K(a) in L/K is a finite 
Galois extension containing a; cf. 3.5/7. In particular, every 0 € Gal(L/K) is 
uniquely determined by its restrictions to the L;. On the other hand, for every 
two Galois extensions L;, Ll; € £, there is some L, € & such that L;UL; C Lp, 
namely the composite field L;-L; = K(L;,L,;). Thus, if (o;) is a family of Ga- 
lois automorphisms satisfying o;|_, = 0; for L; C L;, then the o; give rise to a 
well-defined map a0: L —> L. The latter is a K-automorphism, since for any 
two elements a,b € L, say a € Lj, b € Lj, there is always an index k such that 
a,b € Ly, and since we can use the fact that o, is a K-automorphism. 

Next consider a subgroup H C Gal(L/K). Similarly as before, we can re- 
strict H to each L; and thereby look at the subgroups H; = f;(H) C Gal(L;/K), 
i € J. An element a € L is invariant under H if and only if it is invariant under 
a subgroup (or alternatively, all subgroups) H; such that a € L;. However, in 
contrast to the situation encountered before, H will in general not be uniquely 
characterized by its restrictions H;. As an example, one may consider the ab- 
solute Galois group of a finite field, which will be computed at the end of the 
present section. This indeterminacy of H is the actual reason for the fact that 
the fundamental theorem of Galois theory 4.1/6 cannot be extended to infinite 
Galois extensions without modifying its assertion. It is necessary to consider a 
certain closure of subgroups in Gal(L/4), and the easiest way to do this is by 
using concepts from topology. 

Recall that a topology on a set X consists of a system T = (U;);<7 of subsets 
of X, the so-called open sets, such that the following conditions are satisfied: 

(i) @, X are open. 

(ii) The union of arbitrarily many open subsets of X is open. 

(iii) The intersection of finitely many open subsets of X is open. 

The pair (X,&) (in most cases simply denoted by X) is called a topological 
space. For a point x € X, open sets U C X containing x are called open 
neighborhoods of x. Complements of open subsets of X are referred to as closed 
subsets of X. Furthermore, given an arbitrary subset S C X, we can consider its 
closure S. The latter equals the intersection of all closed subsets of X containing 
S. In other words, it is the smallest closed subset of X containing S and hence 
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consists of all points « € X such that UMS # § for every open neighborhood 
U of x. As usual, a map of topological spaces (X’,T’) —> (X,T) is called 
continuous if the preimage of every T-open subset of X is T-open in X’, or 
equivalently, if the preimage of every T-closed subset of X is T/-closed in X’. 

To introduce a topology on a set X, we can start out from an arbitrary 
system %8 of subsets of X and look at the topology generated by it. To set up 
the latter, we enlarge 8 to a system 8’ by adding the special subset X C X, as 
well as all finite intersections of subsets of X that belong to 8. Then a subset 
U Cc X is said to be open if it is a union of sets belonging to %’, or in other 
words, if for every x € U there is some V € 8’ such that x € V C U. It is easily 
seen that in this way, we obtain a topology T on X. One calls T the topology 
on X that is generated by 8. Note that { is the weakest topology on X such 
that the members of % are open in X, i.e., every further topology { with this 
property is finer than & in the sense that every T-open subset of X is T’-open 
as well. Moreover, it is easily checked that the enlargement from 8 to %’ is 
unnecessary if X is the union of all members of % and if the intersection of any 
two sets U,V € % is a union of subsets of X that belong to B. 


As an example of the preceding construction, we can define the product of 
a family of topological spaces (X;)je7. To do this, consider on the Cartesian 
product of sets [],-;X; the topology generated by all subsets of type [],<; Vi, 
where U; is open in X; and U; = X; for almost all z € J. This is the weakest 
topology on [],<; Xi such that all projections onto the factors X; are continuous. 
Also we want to introduce the restriction of a topology on a set X to a subset 
V c X. Thereby we mean the topology on V whose open sets consist of all 
intersections of open sets in X with V. This topology is also referred to as the 
topology that is induced from X on V. 


We now return to a Galois extension L/K as considered before, and look 
again at the system £ = (L,)je; of all its subextensions that are finite and 
Galois over K, together with the corresponding restriction homomorphisms 
fi: Gal(L/K) —+ Gal(L;/K). We equip the finite group Gal(L;/K) for ev- 
ery i € I with the discrete topology, i.e., with the topology in which every 
subset of Gal(L;/A) is open. Moreover, we consider on Gal(L/k‘) the weakest 
topology such that all restrictions f;: Gal(L/K) —> Gal(L;/K) are continu- 
ous. Since Gal(L;/K) carries the discrete topology for every 7, the latter equals 
the topology that is generated by all fibers of the maps f;.4 


Remark 1. (i) A subset U C Gal(L/K) is open if and only if for every 0 © U 
there is an index i € I such that f,'(filo)) CU. 


(ii) A subset A C Gal(L/K) is closed if and only if for every o € Gal(L/K) 
not belonging to A there is an index i € I such that f,'(fi(o))N A =O. 

(iii) For a subset S C Gal(L/K) its closure S consists of all o € Gal(L/K) 
such that f>'(filo)) 1S £0 for alli € I. 


| The fibers of a map f: X —> Y are given by the preimages f~'(y) of points y € Y. 
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Proof. We restrict ourselves to proving (i), since the remaining assertions are 
formal consequences of it. Let 8 be the system of all fibers of the restriction 
maps f;, 7 € I. Due to the description of the topology generated by a sys- 
tem of subsets of a set X, we have only to show that it is not necessary to 
enlarge SB to a system %8’, as explained before, by adding finite intersections 
of members of %, i.e., that for any two automorphisms o; € Gal(L;/K) and 
a; € Gal(L,;/K), the intersection f7'(0;)N fy (o;) is a union of certain fibers 
of restriction maps f,: Gal(L/K) —>+ Gal(L;,/K). To achieve this, choose an 
index k € I such that L;U L; C Ly. Since f; is the composition of fj, with the 
restriction map Gal(L;/K) —+ Gal(L;/K), we see that f;'(c;) is the union of 
fibers of f,: Gal(L/K) —> Gal(L;/K). The same is true for £3), and it 
follows that also f7*(a;)N f;°(0;) is a union of fibers of f;,. 


Using Remark 1, we can easily see that Gal(L/K) is a topological group. By 
this one understands a group G equipped with a topology such that the group 
law Gx G —> G and the map of taking inverses G —> G are continuous, where, 
of course, G x G is considered as a product in the sense of topological spaces. To 
further illustrate the topology on Gal(L/J‘) let us prove the following assertion: 


Remark 2. The topological group Gal(L/K) is compact and totally discon- 
nected. 


Prior to giving the proof, let us recall that a topological space X is called 
quasicompact if every open covering of X contains a finite subcovering. Further- 
more, X is called compact if X is quasicompact and Hausdorff, which means 
that for x,y € X there exist disjoint open subsets U,V C X such that x € U, 
y € V. Finally, X is called totally disconnected if for every subset A C X con- 
taining at least two points, there exist two open subsets U,V C X such that 
ACUUV aswellasUNAZADAVNA and UNANV =9. For example, if 
X carries the discrete topology, then X is Hausdorff and totally disconnected. 
If, in addition, X is finite, then it is compact as well. 


Proof of Remark 2. The restriction maps f;: Gal(L/K) —> Gal(L;/K) 


induce an injective homomorphism 


Gal(L/K) > || Gal(Li/K), 


ier 


which we will view as an inclusion. Furthermore, [] Gal(L;/K’), as a product of 
finite discrete and hence compact topological spaces, is itself compact, due to 
Tychonoff’s theorem (a fact that in the present situation can also be justified by 
an elementary argument). Since [] Gal(Z;/A’) induces on Gal(L/K) the given 
topology, the latter is seen to be compact if we can show that Gal(L/K) is 
closed in J] Gal(Z;/K). To achieve this, consider a point (0;) € [] Gal(L;/K) 
that does not belong to Gal(L/K), i.e., such that there are two indices 7, j’ € I 
such that L; C Lj, but oy|_, 4 oj. Then the set of all (o/) € J] Gal(L;/K) 
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satisfying oj = 0; and oj, = oj forms an open neighborhood of (0;) that is 
disjoint from Gal(L/K). This shows that Gal(L/) is closed in [] Gal(L;/K). 

To show that Gal(L/K) is totally disconnected, it is enough to show that 
[[ Gal(Z;/K), as a product of discrete topological groups, is totally discon- 
nected. Let (o;) and (a/) be two different elements of [] Gal(Z;/A’). Then there 
exists an index 7 € J such that oj; # oj, and we can define open subsets 


V =[] Vi and V’ = J] VY in [] Gal(L;/K) by 


V= Gal(L;/K) fori 4 J, Vig Gal(L;/K) for iF j, 

“| {a5} for i = j, | Gal(L;/K)—{o;} for i=j. 
Hence, we get (a;) € V, (a4) € V’, as well as [[ Gal(Z;/K) = V UV’ and 
VV’ =9. From this we can read immediately that [] Gal(L;/K) satisfies the 
defining condition of a totally disconnected topological space. 


Now we are able to generalize the fundamental theorem of Galois the- 
ory 4.1/6 to arbitrary Galois extensions. 


Proposition 3. Let L/K be a (not necessarily finite) Galois extension. Then 
the intermediate fields of L/K correspond bijectively to the closed subgroups 
of Gal(L/K). More precisely, the assertions of the fundamental theorem 4.1/6 
remain valid if we restrict ourselves to those subgroups H C Gal(L/K) that are 
closed. 


The main work for proving the proposition was already done in Section 4.1; 
see 4.1/7. It remains only to verify for intermediate fields FE of L/K that the 
corresponding Galois group Gal(Z/E) is a closed subgroup of Gal(L/k), and 
that the composition Yo @ from 4.1/6 yields the identity on the set of all closed 
subgroups in Gal(L/k). Both facts are consequences of the following result: 


Lemma 4. Let H C Gal(L/K) be a subgroup and let L" C L be the correspond- 
ing fixed field. Then Gal(L/L"), viewed as a subgroup of Gal(L/K), equals the 
closure of H in Gal(L/K). 


Proof. As before, we consider the system (L;)ie7 of all intermediate fields of 
L/K such that L;/K is a finite Galois extension, together with the restriction 
maps f;: Gal(L/K) —> Gal(L;/K). Let H; = f;(H). Since an element a € L; 
is invariant under 4 if and only if it is invariant under H;, we get L? OL; = i 
and hence L” = ,.; Li", Now consider a second subgroup H’ C Gal(L/K) 
and set H/ = f,(H’). Then, by 4.1/4 or 4.1/6, we see that we get L? = L”" if 
and only if H; = Hj for all i € I. However, H’' := (),-, f, '(Hi) is clearly the 
largest subgroup in Gal(Z//‘) such that f;(H’) = H; for all i € I and hence 
such that L”’ = L#. Thereby we see that H’ = Gal(L/L"). 

On the other hand, the closure H of H in Gal(L/K) is computed according 
to Remark 1 (iii) as follows: 
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H = {o € Gal(L/K) ; f;"(fi(o)) 0H £0 for alt i € T} 


7 


= {o € Gal(L/K) ; f,() € H; for allie rh 


Thus, Gal(Z/L") is recognized as the closure of the subgroup H C Gal(L/K). 


In the setting of Proposition 3, the open subgroups of Gal(L/K) can be 
characterized as follows: 


Corollary 5. Let L/K be a Galois extension and H a subgroup of Gal(L/K). 
Then the following assertions are equivalent: 

(i) H is open in Gal(L/K). 

(ii) H is closed in Gal(L/K) and the fixed field L" is finite over K. 


Proof. First assume that H is open in Gal(ZL//’). Then H is closed in Gal(L/K) 
as well, since all its left (resp. right) cosets in Gal(L/K) are open, and hence the 
complement of H is open in Gal(Z/’). Furthermore, using Remark 1 (i), there 
exists a finite Galois extension L’/K in L such that H contains the kernel of 
the restriction map Gal(L/K) —> Gal(L’'/K), which equals Gal(L/L’). Then, 
using Proposition 3, we get L? c LO!Z/") = L’, and L# is finite over K, since 
the same is true for L’. 

Conversely, if H is closed in Gal(L/K) and L"”/K is finite, we can con- 
sider the normal closure L’ c L of L"/K, which is finite over K; cf. 3.5/7. 
Then Gal(Z/L’) is open in Gal(Z/K’) according to Remark 1 (i), and we have 
Gal(L/L') Cc Gal(L/L") = H, again by Proposition 3. In particular, H is open 
in Gal(L/K). 


In studying infinite Galois extensions L/K, it is often convenient to view 
the Galois group Gal(Z/K) as the projective limit of the finite Galois groups 
Gal(L;/K), where (L;)ier, as usual, denotes the system of all intermediate fields 
of L/K that are finite and Galois over kK’. Let us briefly explain the formalism 
of projective limits. 

Consider a partially ordered index set J with order relation < as in 3.4, 
together with group homomorphisms f;;: G; —> G; for indices 1,7 € I, i < j, 
and assume the following conditions: 

(i) fig = idg, for alli € J. 

(ii) Sik = fi ° fik for 2 < j < Kh 

Such a system (Gi, fij)ijer is called a projective system of groups. In a sim- 
ilar way one can define projective systems of sets or of sets with additional 
structures. For example, for a projective system of topological groups it is re- 
quired that all maps fj; be continuous homomorphisms. A group G' together 
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with homomorphisms f;: G —> G; satisfying fi; = fi; 0 fj for 7 < 7 is called 
a projective limit of the system (Gi, f;;) if it admits the following universal 
property: 

If hy: H —+ Gi, 1 € I, are group homomorphisms satisfying hi = fiz 0 hj 
for i < j, then there exists a unique group homomorphism h: H —+ G such 
that h; = fioh for allie I. 

This condition is illustrated by the following commutative diagram: 


If a projective limit G exists, it is unique up to canonical isomorphism, as is the 
case for any object defined in terms of a universal property. The reason is as 
follows. If in the above situation, together with (G, f;), also (H, h;) is a projective 
limit of (Gi, fi;), then besides h: H —+ G, there is also a homomorphism 
g: G — H satisfying the compatibilities as specified in the above diagram. 
Taking into account the uniqueness condition in the definition of projective 
limits, we see that the maps goh,idy,: H —> H coincide, and that the same is 
true for ho g,idg: G — G. Thus, h/ and g are inverse to each other. We write 
G= lim, G;, for the projective limit of the system (Gj, fi;), where in most 
cases, the homomorphisms f; are not mentioned explicitly, certainly if they are 
defined in an obvious way. 

If (Gi, fij) is a projective system of topological groups and (G, f;) is its 
projective limit in the sense of ordinary groups, then we can equip G with 
the weakest topology such that all homomorphisms f; are continuous. This is 
the topology generated by all preimages f; '(U) of open subsets U C Gj; it is 
referred to as the projective limit of the topologies on the groups G;. Indeed, 
G equipped with this topology is a projective limit of (Gi, fi;) in the sense of 
topological groups. 

Let us add along the way that there is also a notion that is dual to that 
of projective limits, namely the notion of inductive (or direct) limits lim. The 
definition of an inductive system, resp. of an inductive limit, is obtained by 
inverting the direction of all arrows occurring in the corresponding setup of a 
projective system, resp. of a projective limit. In addition, for inductive systems 
it is required that the inherent index set J be directed in the sense that for 
arbitrary indices i,7 € J, there is always an index k € I such that i,7 < k. 
Projective and inductive limits of groups (resp. sets, or rings, etc.) exist always, 
as is easily verified. However, here we are interested only in the projective case: 


Remark 6. Let (Gi, fi;) be a projective system of groups. 
(i) The subgroup 
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G => { (xs )ier 3 fi (xj) = Xj for 2 < j} Cc [[4: 


iel 


together with the group homomorphisms f;: G —> G; that are induced from the 
projections of [];-; G; onto its factors, constitutes a projective limit of (Gj, fi;). 
In particular, every system (ai)ier € []je,Gi satisfying fij(aj;) = 24 for 
i<j determines a unique element x € lim Gj. 
(ii) If (Gi, fij) ts @ projective system of topological groups and G is as in (i), 
then the restriction of the product topology on [],<; Gi to G equals the projective 
limit of the topologies on the G;. 


Our main example of projective systems is inspired from Galois extensions. 
Indeed, let L/K be a Galois extension and let £ = (L;)ier be the system of all 
intermediate fields that are finite and Galois over K. We introduce a partial 
ordering on I by setting 7 < 7 if L; C L;. Furthermore, write G; = Gal(L;/K) 
for i € I and let fi;: Gal(L;/K) —> Gal(L;/K) for i < j be the restriction 
map. Then (Gj, fi;) is a projective system of groups, resp. (discrete) topological 
groups, and we can prove the following result: 


Proposition 7. The restriction maps fi: Gal(L/K) —> Gal(L;/K) define 
Gal(L/K) as the projective limit of the system (Gal(L;/IX), fi), ¢-e., 


Gal(L/K) = lim Gal(L;/K). 


il 
This holds in terms of ordinary groups, but also in terms of topological groups. 


Proof. It is enough to check the defining universal property of a projective 
limit in terms of ordinary groups, since the topology given on Gal(L/I) coin- 
cides by its definition with the projective limit of the topologies on the groups 
Gal(L;/K). Therefore, consider group homomorphisms h;: H —+ Gal(L;/K) 
that are compatible with the restriction maps f;;. To verify the unique- 
ness part of the universal property, assume there is a group homomorphism 
h: H —+ Gal(L/K) satisfying h; = f; 0h for alli € I. Fixing an element 
x € H, write o = h(a), as well as o; = h(x). Then the relation h; = f; oh 
implies o; = o|z,. Since L equals the union of all L;, it follows that o = h(x) 
is uniquely determined by the elements o; = h;(a). On the other hand, we can 
use this observation to settle the existence part of the universal property and 
to construct a homomorphism h: H —+ Gal(L/K) as desired. Indeed, look at 
an element x € H and let o; = h;(x), 1 € I, denote the images of 7. Then the 
relations hj = f,; 0h; for i < j, and hence L; C L;, show that o; = o;|1,. Now 
use L = U,-, Li and the fact that J is directed, which means for i,j € J that 
there is always an index k € I satisfying i,j < k, ie., L; UL; C Ly. Then we 
can conclude that the o; determine a well-defined automorphism o € Gal(L/K) 
restricting to 0; on each L;. Mapping x € H in each case to the corresponding 
element 0 € Gal(L/K), we obtain a group homomorphism h: H —> Gal(L/K), 
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as desired. Thus, all in all, Gal(Z/K) admits the defining universal property of 
a projective limit of the system (Gal(L;/K))jer, and we are done. 


In view of Proposition 7, one calls Gal(L/K) a profinite group, thereby 
indicating that it is a projective limit of finite (discrete) groups. Let us add 
for a projective system (Gi, fi;)ijer with a directed index set I that in order to 
determine the corresponding projective limit, it is enough to execute this limit 
over a cofinal subsystem. Here a subsystem (Gi, fij)i jer of (Gi, fij)ijer is called 
cofinal if for every index i € J, there exists an index 2’ € J’ such that i < 2’. For 
example, if (Z;)icr is a subsystem of the system (L,;)jer of all intermediate fields 
of L/K that are finite and Galois over K,, and if for every i € I there is some 
i’ € I’ such that L; C Ly, then Gal(L/K) is already the projective limit of the 
Galois groups Gal(L;/K), 7 € I’. Note that the index set J is directed in this 
case, since for 7,7 € J there is always an index k € J such that L;U L; C Lx. 

Finally, let us have a look at an example in which we compute an infinite 
Galois group. Let p be a prime number and F an algebraic closure of the field 
F,, with p elements. Then every finite extension of F, is of type F, for a power 
q =p", see 3.8/2, and we may view all such fields F, as subfields of F; see 3.4/9 
and 3.8/3. We want to compute the Galois group Gal(F/F,) for a fixed power 
q = p", the so-called absolute Galois group of F,. To do this, we consider the 
system of all finite Galois extensions of F,, hence by 3.8/3 and 3.8/4 the system 
(Fi )ien—{o}- Then Proposition 7 says that 

Gal(F/F,) = lim Gal(Fy:/F,). 
i€N—{0} 

Let us look more closely at the projective limit on the right-hand side. We 
write ¢: F —> F, at > (a”)" = a’, for the nth power of the Frobenius homo- 
morphism on F; similarly as in Section 3.8, one calls o the relative Frobenius 
homomorphism over Fy. Recall that Fy C F is the splitting field of the polyno- 
mial X47 —X over F,,; see 3.8/2. Hence, the fixed field under the cyclic subgroup 
of Gal(F/F,) that is generated by o equals F,. Furthermore, let us write o; for 
the restriction of o to the finite extension Fj: of F,. Then we can read from 
3.8/3, resp. 3.8/6, the following: 


qd 


Remark 8. (i) The Galois group Gal(Fji/F,) is cyclic of order i, generated by 
the restriction o; of the relative Frobenius homomorphism over Fg. 

(ii) We have Fyi C Fy if and only if i divides j. If this is the case, the 
restriction homomorphism Gal(F,i/F,) —> Gal(F,:/F,) maps the generating 
element ao; to the generating element o;. 


Thereby we see that in order to determine the limit lim Gal(F;: /F,), we 
have to execute the projective limit over the system (Z/?Z)icn—so}. In more 
detail, the order relation on N—{0} is the divisibility relation, while for i | J 
the corresponding homomorphism f;;: Z/jZ —> Z/iZ is the one mapping the 
residue class I € Z/jZ to the residue class T € Z/iZ. Thus, we obtain the 
following result: 
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Proposition 9. There exists a unique isomorphism of topological groups 


Gal(F/F,) ~ lim Z/iZ, 


iEN—{0} 


such that the relative Frobenius homomorphism o € Gal(F/F,) corresponds to 
the system of residue classes 1 € Z/iZ, i € N—{O}. 


We write Z = him (0} Z/iZ, where the limit may just as well be viewed as 


a projective limit of rings, or topological rings.” In particular, we observe that Z, 
up to canonical isomorphism, is the absolute Galois group of every finite field. 
Moreover, Z is canonically a subgroup of Z, since the projections Z —> Z/iZ 
give rise to an injective homomorphism Z —> Z. In fact, Z corresponds to the 
free cyclic subgroup (c) in Gal(F/F,) that is generated by the relative Frobe- 
nius homomorphism over F,. Since all projections Z —> Z/iZ are surjective, 
we conclude that Z lies dense in Z, so that o generates a dense subgroup in 
Gal(F/F,), ie., a subgroup whose closure equals Gal(F/F,). By the way, this 
fact is also a consequence of Lemma 4, since F, can be interpreted as the fixed 


field F‘”. We will see below that Z is a proper subgroup of Z, even a subgroup 
that is significantly “smaller” than Z itself. In particular, it follows that the 
relative Frobenius homomorphism o generates a subgroup in Gal(F/F,) that is 
not closed. rm 

We may view Z, as the notation indicates already, as a certain closure of the 
ring Z, although other closures of Z are thinkable as well. For example, when 
executing the projective limit of the quotients Z/iZ, we may restrict ourselves 
to integers i varying only over a certain subset of N—{0}. Indeed, for a prime 
number ¢, the projective limit of topological rings 


Ly = lim Z /O7Z 
ven 
is referred to as the ring of integral ¢-adic numbers. In our situation these rings 


are quite useful, since their structure is easy to describe and since, on the other 
hand, Z decomposes into a Cartesian product of them: 


Proposition 10. There exists a canonical isomorphism of topological rings 


Z= lim Z/iZ~ [| Z 


iEN—{0} £ prime 


Proof. We show that P := [], prime Z¢, together with canonical homomorphisms 
fi: P — Z/iZ to be introduced below, admits the universal property of a 
projective limit of the system (Z/iZ)jcen—so;. To do this, look at an integer 


? For a topological ring R it is required that R be a topological group with respect to the 
addition, and furthermore, that the ring multiplication be continuous. 
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i € N—{0} with prime factorization i = [], @’, where, of course, almost all 
exponents 1,(2) are zero. Applying the Chinese remainder theorem in the version 
of 2.4/14 yields that the canonical homomorphism 


(x) Z/iZ— [I 2/ez 


£ prime 


is an isomorphism, and we obtain a canonical homomorphism 


fir P— [2/7 > 2/2. 


£ prime 


Varying i over N-{0}, the homomorphisms f;: P —> Z/iZ are compatible with 
the projections fi,;: Z/j7Z —+ Z/iZ for i| 7. Furthermore, the definition of the 
f, shows that the topology on P coincides with the weakest one such that all f; 
are continuous. Therefore, it remains only to check that (P, f;) is a projective 
limit of (Z/iZ, f,;), in the sense of ordinary rings. 

However, this is more or less straightforward. Fix a ring R and consider ring 
homomorphisms h;: R —> Z/iZ for 1 € N—{0} that are compatible with the 
projection maps f;;. Relying on isomorphisms as in (*), the h; induce for every 
prime number £ homomorphisms h,.: R — Z/ (OZ, that are compatible with 
the projection homomorphisms of the projective system (Z/€’Z),cx. Therefore, 
the h,¢ define a rmg homomorphism he: R —> im " Z/t’Z and hence, letting 
é vary, a ring homomorphism h: R —+ P, which satisfies h; = f;, 0 h for all 2. 
Finally, since the h;, are equivalent to the h;, we can use the uniqueness part 
of the universal property of the limits Z, to show that h is uniquely determined 
by the h,. 


Thus, we can summarize: 


Theorem 11. Let F be a finite field and F an algebraic closure. Then there 
exists a canonical isomorphism of topological groups 


Gal(F/F) ~ II Le 


é prime 


such that the relative Frobenius homomorphism o € Gal(F/F) corresponds to 
the element (1,1,...) € TI, prime Z¢. Here 1 denotes in each case the unit element 
in Ze, which is viewed as a ring. 


In particular, we thereby see that the free cyclic subgroup Z C Gal(F/F) 
that is generated by the relative Frobenius homomorphism o is significantly 
“smaller” than the Galois group Gal(F/F), in fact, even significantly “smaller” 
than the ring of integral f-adic numbers Z,. Indeed, it is not too hard to see 
(and this justifies the terminology of ¢-adic numbers) that the elements of Z, 
correspond bijectively to all formal infinite series )>%° ) c,¢” with integer coef- 
ficients c,, 0 < c < €—1. In this setting, the subset N of natural numbers is 
represented by all finite sums of this type. 
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Exercises 


i 


10. 


Make precise the basic idea that makes it possible to extend the fundamental theo- 
rem of Galois theory 4.1/6 in a straightforward way to infinite Galois extensions. 


Explain why infinite Galois groups should be viewed rather as topological or profi- 
nite groups than as purely abstract groups. 


Let X be a set and (X;)ier7 a system of subsets of X. For indices i,j € I such 
that X; C Xj, let fi; be the inclusion map X; —> Xj. 

(i) Write i < 7 if X; C X; and show that (Xj, fij) is a projective system of sets 
satisfying lim,_, Xi = Nicer Xi- 

(ii) Write i < j if X; C Xj; and assume that the index set J is directed with 
respect to the relation <. (However, in the present context, the condition 
“directed” is without significance.) Show that (Xj, fj;) is an inductive sys- 
tem of sets satisfying lim, _ Xi = User Xi- 


Show that every inductive system of groups admits an (inductive) limit. 


Let K be a field and K an algebraic closure of K. Show that the absolute Galois 
group Gal(K/ K) is independent of the choice of K, up to isomorphism. 


Let L/K be a field extension and let (L;)icr be a system of intermediate fields 
such that L;, in each case, is Galois over K and such that for i,j € J there 
is always an index k € I satisfying L; UL; C Lx. Furthermore, let L’ be the 
smallest subfield of L containing all L;. Show that L’/K is Galois and that we 
have Gal(L//K) = lim Gal(L;/I) in the sense of topological groups. 
Let L/K be a Galois extension and F an intermediate field such that E/K is 
Galois. Show: 
(i) The restriction homomorphism y: Gal(L/K) —> Gal(£/K) is continuous. 
(ii) The topology on Gal(E/K) equals the quotient topology with respect to 
Y, ie., a subset V C Gal(E/K) is open if and only if y~1(V) is open in 
Gal(L/K). 
Can there exist a Galois extension L/K satisfying Gal(L/K) ~ Z? 
Look at the situation of Theorem 11. 


(i) For a prime number £, determine the fixed field of Zp, viewed as a subgroup 
of Gal(F/F). 
(ii) Determine all intermediate fields of F/F. 


Consider the ring Zp = lim | Z/e’Z of integral ¢-adic numbers associated to a 
prime number @. For an element a € Zp, let v(a) be the maximum of all integers 
v € N such that the residue class of a in Z/€’Z is zero; set v(a) = oo for a = 0. 
Furthermore, define the ¢-adic absolute value of a by |alp = €-’®. Show for 
a,b € Zy: 

(i) jale =O a =0, 

(ii) |a- dle = lale- |ble, 

(iii) Ja + ble < max{]a|e, |blc}. 
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11. Show that the ¢-adic absolute value | - |¢ of Exercise 10 induces the topology of 
Ze (in the sense that a subset U C Ze is open if and only if there exists for every 
point of U an é-adic ¢-neighborhood that is contained in U). Furthermore, show 
that (1—2)-t = rar ¢’, where the convergence is naturally understood in terms 
of the adic absolute value. Using a similar argument, one can show that every 
element a € Z, satisfying |a|p = 1 is a unit in Zp. 


4.3 The Galois Group of an Equation 


Let K be a field and f € K[X] a nonconstant polynomial. Furthermore, let L 
be a splitting field of f over K. Then, if f is separable, L/K is a finite Galois 
extension, and Gal(L/K‘) is referred to as the Galois group of f over K or, more 
suggestively, as the Galois group of the equation f(x) = 0. 


Proposition 1. Let f € K[X] be a separable polynomial of degree n > 0 with 
splitting field L over K, and let ay,...,Qn € L be the zeros of f. Then 


y: Gal(L/K) — S({a1, as <n }) ~ Gy, 
ars | iii ates 


defines an injective group homomorphism from the Galois group of L/K to the 
group of permutations of ay,...,Qn, resp. to the group G, of permutations of 
n elements. In particular, Gal(L/K) can be viewed as a subgroup of G,,, and it 
follows that [L: K] = ordGal(L/K) divides ord G,, = nil. 

Furthermore, f is irreducible if and only if Gal(L/K) operates transitively 
on the zero set {a1,...,Qn}, te. if and only if for every two of these zeros 
a;,a;, there exists an automorphism a € Gal(L/K) such that o(a;) = a;. Such 
is always the case for [L: K] =nl, resp. Gal(L/K) ~ Gy. 


Proof. Consider an automorphism ¢ € Gal(L/K). Since o leaves the coefficients 
of f invariant, it maps zeros of f to zeros of f. Furthermore, o is injective and 
hence induces on {a ,...,@,} an injective and therefore bijective self-map, in 
other words, a permutation. This shows that the map y is well defined. It is 
injective as well, due to L = K(a,,...,@,), since every K-homomorphism of 
Gal(L/K) is uniquely determined by its values on the elements ay,...,Qp- 

Now assume that f is irreducible, and consider two zeros a;,a,; of f. Using 
3.4/8, there is a K-homomorphism o: K(a;) —> K(a;) such that o(a;) = aj. 
This extends by 3.4/9 to a K-homomorphism o/: L —> L, where L is an 
algebraic closure of LZ. Since the extension L/K is normal, it follows that o’ 
restricts to a K-automorphism of L and thereby to an element o” € Gal(L/K) 
satisfying o’(a;) = aj. 

On the other hand, if f is reducible, consider a factorization f = gh into 
nonconstant polynomials g,h € K[X]. Then every o € Gal(L/K) induces a 
self-map on the zeros of g and, in the same way, on the zeros of h. However, 
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since f is separable by our assumption, the zeros of g are disjoint from those of 
h, and we thereby see that o cannot act transitively on the zeros of f. 


Since every finite Galois extension L/K is simple by the primitive element 
theorem 3.6/12, it follows that L is a splitting field of a polynomial of degree 
n= ([L: K] in K[X]. Thus, we can conclude the following corollary: 


Corollary 2. For a finite Galois extension L/K of degree n, the Galois group 
Gal(L/K) may be viewed as a subgroup of the permutation group Gy. 


Also note in the setting of Proposition 1 that the Galois group Gal(L/K) 
will in general be a proper subgroup of G,. For example, if f € KX] is the 
minimal polynomial of a primitive element of L/K and n is its degree, we get 
ord(Gal(L/K)) = n < n! = ordG, for n > 2. Hence, as a general rule, not 
every permutation of the zeros of f as in Proposition 1 is induced from a Galois 
automorphism. 

Let us compute the Galois group of polynomials f € KX] in some special 
cases. 


(1) Consider a polynomial f = X*+aX +b € K[X], where we assume that 
f does not admit a zero in KX’. Then f is irreducible in K [X] and, furthermore, 
separable if char K 4 2 or a £ 0. Adjoining a zero a of f to K, the resulting 
field L = K(q) is a splitting field of f over K,, so that L/K is a Galois extension 
of degree 2. In particular, the Galois group Gal(L/K) is of order 2 and thus 
necessarily cyclic. 


(2) Next consider a polynomial of type f = X° + aX +b € K[X], where 
char kK # 2,3. Let us mention along the way that every monic polynomial 
X3+4¢X?+...€ K[X] of degree 3 can be assumed to be of the latter form, 
using the substitution X ++ X —c for c= $1; the splitting field and Galois 
group of f remain unchanged under such a transformation. We assume that f 
does not have a zero in Kk’. Then f is irreducible in K |X] and, by our assumption 
on char Kk’, also separable. Let L be a splitting field of f over K andae La 
zero of f. Then K(a)/K is an extension of degree 3, and the degree [L : K] 
may be 3 or 6, depending on whether or not K(q) is already a splitting field of 
f. Likewise, Gal(L/K) will be of order 3 or 6, where in either case, we can view 
this group as a subgroup of 63, according to Proposition 1. In the first case, 
Gal(L/K) is cyclic of order 3; every element o € Gal(L/K) different from the 
identity is a generating element, since orda > 1 and (orda)|3 imply orda = 3. 
In the second case we get Gal(L/K’) = Gs, since ord Gal(L/K) = 6 = ord G3. 

We want to explain a general principle allowing one to find out which of the 
two cases we are facing when considering special examples. Let a1, a2,a3 € L 
be the zeros of f and write 


0 = (a1 — a2)(a2 — a3)(a1 — a3). 


Then A = 6? is called the discriminant of the polynomial f; see also Section 4.4 
for this notion. Since A is invariant under the automorphisms in Gal(L/IX), we 
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get A © K, where in our special case, an easy calculation shows that 
A = —4a? — 270”. 


Applying an automorphism o € Gal(L/K) to 6, it is possible that the factors of 6 
change signs. Therefore, we get 7(0) = +0, depending on whether o corresponds 
to an even or an odd permutation in G3. (A permutation 7 € G,, is called even, 


resp. odd, if 
sgn(7) = II m™(t) im m5) 


=a 
i<j J 


the sign of 7, assumes the value 1, resp. —1; note that the function sgn is 
multiplicative in the sense that sgn(zo 7’) = sgn(7)-sen(z’) for 7, 7’ € G,,; see 
also 5.3.) 

The even permutations in G, form a subgroup, the so-called alternating 
group ,,. This group coincides with the kernel of the group homomorphism 


6, — {1,-]}, TH sgn(7), 


which is surjective for n > 1. Thus, 2, is a normal subgroup of index 2 in 6, 
for n > 1. In addition, we see that all permutations 7 € G, whose order is 
odd must belong to 2,,. In particular, 23 is the only subgroup in G3 of order 3. 
Hence, the following equivalences hold: 


ord Gal(L/K) = 3 
<=> Gal(L/K) C 63 contains only even permutations 
— dock 
<=> A admits a square root in K. 


Therefore, we can figure out whether Gal(Z/‘) is of order 3 or 6 by checking 
whether the discriminant A admits a square root in K or not. 

To look at an explicit example, let L be the splitting field of the polynomial 
f = X°?-X +1 € Q{[X]; it is irreducible, since f does not split off a linear 
factor in Z[X]. Then we get Gal(L/Q) = G3, since VA = /—23 ¢ Q. 


(3) To give another example, let us look at special irreducible polynomials 
of degree 4, more precisely, irreducible monic polynomials f € Q[X] that are 
biquadratic in the sense that their linear and cubic terms are trivial. Such 
polynomials are of type f = (X? — a)? — b, where we will assume b > a? in the 
following. For instance, the polynomials X*— 2 and X* — 4X? — 6 are of this 
type. The zeros of f in C are given by 


a=Vat+vo, —a, B= a— Vb, —B, 


where Vb > |a|. Therefore, a is real, in contrast to 8, which is a square root 
of a negative real number. Furthermore, the splitting field of f in C is given 
by L = Q(a, 8). To determine its degree over Q, observe that a is a zero of f 
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and hence has degree 4 over Q, which means that [Q(a) : Q] = 4. Further, 8, 
as a square root of a— Vb € Q(a), is of degree < 2 over Q(a). Since Q(a) is 
contained in R but is not, it follows that in fact, 6 is of degree 2 over Q(a) 
and that we can conclude that [ZL : Q] = [Q(a, 8) : Q] =8. 

Now let us determine the Galois group Gal(L/Q). To do this, we use 
Proposition 1 and view Gal(L/Q) as a subgroup of the permutation group 
S({a, —a, 8,—8}) of the zeros of f. In doing so, we know already that L/Q) 
of degree 8 and hence that Gal(L/Q) is of order 8. Furthermore, every 
a € Gal(Z/Q) is a field homomorphism and as such satisfies the relations 
—a) = —o(a), o(—8) = —o(8). However, there are precisely eight permuta- 
tions in S({a, —a, 6, —(}) satisfying these conditions. Indeed, if we want to set 
up such a permutation, there are precisely four possibilities to select a(a), where 
a(—a) is determined by o(—a) = —o(a). Then there remain two possibilities to 
select o(3), where again, o(—{) is determined by the relation o(—6) = —o({). 
As a result, there are precisely eight permutations in S({a,—a, 6, —(}) satis- 
fying the relations o(—a) = —o(a), o(—8) = —o(), and it follows that these 
must coincide with the elements of Gal(Z/Q). To explicitly describe this group, 
consider the two elements 0,7 € Gal(L/Q) that are given by 


Ke 
n 


Q 


a: ar>Q, Br>-a, 
T: Qe>-a, BE>B. 
The subgroup (a) C Gal(L/Q) generated by a is cyclic of order 4 and therefore 


normal in Gal(L/Q), since it is of index 2. Furthermore, 7 is of order 2. Since 
T € (co), we obtain 


Gal(L/Q) = (0,7) = (0) Ur(a) = (a) U (a) 7, 
or more explicitly, 
Gal(L/Q) = {l,¢, o*,0°, T, OT, 077, er}: 


To describe the group law on Gal(L/Q) it is enough to check that o and T satisfy 
the relation ta = 0°r. Now it is easy to specify all subgroups of Gal(Z/Q) by 
way of the following scheme: 


Gal(L/Q) 


an — 


{1.0757 or} 115%) oo} {1c eo, or} 


PS, | er | 


{1,7} {1,077} {1,07} {1,o7} {1,07} 


es a 


{1} 
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The subgroups of Gal(L/Q) are in one-to-one correspondence with the inter- 
mediate fields of L/Q, due to the fundamental theorem of Galois theory 4.1/6. 
These fields can be determined by considering suitable elements of degree 2 or 
4 in L that are invariant under the above groups. 

As a counterpart to the preceding class of biquadratic polynomials, we want 
to determine the Galois group of the polynomial f = X* — 4X? + 16 € Q[X]. 
Also in this case, f is of type (X?— a)? — 6, where, however, a = 2 and b = —12 
do not satisfy the above condition b > a?. The zeros of f in C are given by 


a= OT ai a -a, B = jer mc 


resp. 
ae, 20, 20, Be, 


where ¢ = e?""/!? has to be viewed as a square root of $ + 4/3, and likewise, 
e~?7i/12 as a square root of 4 — $iV/3. In particular, adjoining a zero of f to Q, 
say a, we see that L = Q(a) = Q(C) is a splitting field of f over Q. Thus, the 


Galois group of L/Q is of order 4, and its members are characterized by 


Gs Cie; 
a Cee, 
os: Cr 46%, 
oa: CR". 


From this we can read the relations 0; = id, 05 = 03 = of = id, as well 
as 02 0 03 = 04. Furthermore, we observe that Gal(L/Q) is commutative and 
hence that Gal(L/Q) ~ Z/2Z x Z/2Z. Besides the trivial subgroups, Gal(Z/Q) 
contains only the subgroups (02), (03), (04), where in terms of the fundamental 
theorem of Galois theory 4.1/6, these correspond to the intermediate fields 
Q(¢3), Q(C?), Q(V3); use the fact that /3 = ¢+¢!". Thus, up to the trivial 
intermediate fields Q and L, these are the only intermediate fields of L/Q. 
Extensions of type L/Q will be studied in more detail in Section 4.5. Indeed, L 
is constructed from Q by adjoining a so-called primitive 12th root of unity ¢, 
and it belongs to the class of cyclotomic fields. 


(4) As a last example, let us study the generic equation of degree n. To do 
this, we fix a field k and consider over it the field LZ of rational functions in 
finitely many variables 7),...,7;,, namely 


Dah Tipiihg dy OE Digs Tal): 


Every permutation 7 € G, defines an automorphism of L by applying 7 to the 
variables T,,..., 7): 
k(Th, cee Gh) — k(Ti, vers reese 


g(Th, si8 La) jo G(T; ers slate) ; 
ee Ce peers op 


158 4. Galois Theory 


The corresponding fixed field K = LS” is referred to as the field of symmetric 
rational functions in n variables with coefficients in k. As we read from 4.1/4, 
the extension L/K is Galois of degree n! and admits G, as Galois group. 

To specify the “equation” of the extension L/K, we choose a variable X 
and consider the polynomial 


F(X) =][(x-7) 


(—1) - 8;(Th,.--,Tr)-X" 7 ER[N,..., Tr] [X], 


j=0 


where s; is obtained by expanding the product of the factors X — JT; and by 
collecting coefficients of (—1)/X"; it is called the jth elementary symmet- 
ric polynomial (or the jth elementary symmetric function) in T,,...,T;,. More 
explicitly, the elementary symmetric polynomials are given by 


So = 1, 
§=7,+...4+T7), 
592 = 7T,To+ 7,73 +...+7,-1Th, 


$5 = Tics Las 


Viewing f as a polynomial in L[X], we see that it is invariant under the ac- 
tion of G, and hence admits coefficients already in K. Therefore, we have 
k(s1,..., Sn) C K, and it follows that L is a splitting field of f over k(s1,..., Sn), 
resp. A. Moreover, using deg f = n and [L : K] = nl!, we see from Proposition 1 
that f is irreducible in K[X]. 


Proposition 3. Every symmetric rational function in k(T,,...,Tn) can be 
uniquely written as a rational function over k in the elementary symmetric 
polynomials s,,..., Sn. In more precise terms: 

(i) (sq; 24.58, =e. 

(ii) $1,...,5, are algebraically independent over k. 


Proof. To justify (i), observe that 
[isk] =ordG, =n, 

and that k(s,,...,8,) C K. Therefore, it is enough to establish the estimate 
[L ad Sn)| <n. 


However, this is a consequence of Proposition 1, since L is a splitting field of 
f =T[(X — T,;) over k(s1,..., 5). 

To show that the elementary symmetric polynomials s;,...,5, are alge- 
braically independent over k, consider the field k(S,,...,5,) of all rational 
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functions in n variables $,...,S,, as well as a splitting field L of the polyno- 
mial 


n 


F(X) = So(-1)' 5; X"F € k(S1,..., 5n)[X], 


j=0 


where we put So = 1 by convention. Let t,,...,t, be the zeros of f in Tse allowing 
repetitions according to their possible multiplicities. Then we see that 


L=k(S1,...,Sn)(t1,...,tn) = k(t,..., tn), 


since the elements S),..., 5, may be written as elementary symmetric functions 
in t),...,t, and as such belong to k(t1,...,t,). Now the homomorphism 


kK[T,...,Tn] — klti,...,tn], So aT’ Sat’, 


maps elementary symmetric functions in 7),...,7;, to elementary symmetric 
functions in t,,...,t, and thus restricts to a homomorphism 


k[s1,---;8n] —> k[Si,..., Sn], So ays’ 5° ay 8”. 


Since S),...,S,, are variables, this map is necessarily injective and therefore an 
isomorphism. In particular, s;,..., 5, may be viewed as variables and hence are 
algebraically independent over k. 


The idea we have just employed, namely to make use of generic polynomi- 
als, i.e., of polynomials with variables as coefficients, leads immediately to the 
generic equation of degree n. Indeed, fixing variables S),...,5;,, the polynomial 


p(X) =X" 4+ X71 +...4+ Sy € k(S1,..-, $n) X] 


is referred to as the generic polynomial of degree n over k. The corresponding 
equation p(x) = 0 is called the generic equation of degree n. We want to deter- 
mine its Galois group by showing that we may identify p(X) up to isomorphism 
with the polynomial f(X) discussed above. 


Proposition 4. The generic polynomial p(X) € k(S1,...,Sn)[X] of degree n 
is separable and irreducible. It admits G,, as its Galois group. 


Proof. We consider the rational function field L = k(T\,...,T;,) in n variables 
T,,...,T7), over k, as well as the fixed field 


Ka L?* S bl gy :. 24 Bn) 


of all symmetric rational functions; cf. Proposition 3. Since the elementary sym- 
metric polynomials s1,...,5, are algebraically independent over k, we can view 
them as variables and therefore introduce a k-isomorphism 


k(S1,...,Sn) > k(s1,...,5n) = K 
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via S; > (—1)’s;. Interpreting this as an identification, p(X) is transformed 
into the familiar polynomial 


n n 


F(X) =D o(-1) 5; X"9 = ] (xX — 1) € KX] 


j=0 j=0 


that was studied before. Therefore, p, just like f, is separable and irreducible, 
and admits G,, as its Galois group. Furthermore, L is obtained as a splitting 
field of p over k(S1,..., Sn). 


Similarly as we did with symmetric rational functions, one can study sym- 
metric polynomials. For this we restrict the automorphisms of k(T1,..., Tn) 
given by permutations of the variables to the subring k[7,...,7;,]. Just as in 
the case of rational functions, a polynomial f € k[Ti,...,T,] is called sym- 
metric if f is left fixed by all permutations 7 € G,,. Clearly, the elementary 
symmetric polynomials s9,..., Sn) are examples of symmetric polynomials. As 
a generalization of Proposition 3, we will prove the fundamental theorem on 
symmetric polynomials, although at this place only for coefficients in a field k. 
See 4.4/1 for a more general version. 


Proposition 5. For every symmetric polynomial f € k[T,,...,T,], there ea- 
ists a unique polynomial g € k[,S,,...,S,] in n variables Sy,...,S, such that 


f =9(1,---; Sn): 


Proof. The uniqueness assertion follows directly from the algebraic independence 
of the polynomials s1,...,s, over k, as established in Proposition 3. 

To settle the existence part, consider the lexicographic order on N”, where 
we write v < v’ for two tuples vy = (4,...,%), v’ = (,..-,4,) € N” if there 
is an index ip € {1,...,n} such that 4%, < vj,, as well as y% = vj for i < ig. 
Then, for every nontrivial polynomial f = ))cyn VT” € k(Ti,...,T,], the set 
{v © N”; c, £0} contains a lexicographically biggest element. Such an element 
is unique; it is called the lexicographic degree of f and is denoted by lexdeg(/). 
Now let f = Yo enn GT” be a symmetric polynomial with lexicographic degree 
lexdeg(f) = w = (t1,---;fn). Then we have 4, > Wo >... > fn, since f is 
symmetric. Furthermore, 


. Hi—H2 (M2—p3 : 
=o Ss ... 8h" Ek [s1,..., Sn] 


is a symmetric polynomial of total degree 
(141 — fa) + (pe — fas) + 3(ts — fa) +... + ptm = D> poi = |], 
i=1 


which, just like f, starts with c,7 as lexicographically highest term. This 
implies 
lexdeg(f — fi) < lexdeg(f), deg(f — fi) < deg(f). 
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If f is different from f;, we can repeat this step once more, replacing f 
by f — f;. Continuing in this way, we end up with a sequence of elements 
fi, fos... € k[S1,.-., Sn] such that the lexicographic degree of the sequence 


ii fF - ts f-h-s: ae 


decreases step by step, while at the same time, the total degree is bounded 
by deg(f). Therefore, the sequence will end after finitely many steps with the 
zero polynomial, thereby yielding a representation of f as a polynomial in the 
elementary symmetric functions s1,..., 5p. 


The proof of Proposition 5 yields a very effective principle to determine for 
a given symmetric polynomial f a polynomial g satisfying f = g(s1,..-, Sn). 
Note that the principle works quite generally over an arbitrary ring R instead 
of k as coefficient domain. For some examples one may consult Section 6.2, 
where we have to write certain special symmetric polynomials that occur in 
solving algebraic equations of degree 3 and 4 as polynomial expressions in the 
elementary symmetric polynomials. 

Finally, observe that the argument concerning the uniqueness part in the 
proof of Proposition 5 remains valid if we replace the coefficient field k by an 
integral domain R, for example by R = Z. This is enough (granting the existence 
assertion) to define for monic polynomials their discriminant as a polynomial 
in the corresponding coefficients; see Section 4.4, in particular 4.4/3. 


Exercises 


1. Show for every finite group G that there is a Galois extension L/K satisfying 
Gal(L/K) ~G. 
2. Consider a subfield L Cc C such that L/Q is a cyclic Galois extension of degree 4. 


Show that L/Q admits a unique nontrivial intermediate field E, and that E is 
contained in R. 


3. Let K be a field of characteristic # 2 and f € K[X] a separable irreducible 
polynomial with zeros a1,...,Q, in a splitting field L of f over kK. Assume that 
the Galois group of f is cyclic of even order and show: 


(i) The discriminant A = [];.;(ai — aj)? does not admit a square root in K. 
(ii) There is a unique intermediate field F of L/K satisfying [EF : K] = 2, 
namely E = K(V/A). 
4. Leta,BeC,a¥4 Bia 8, be zeros of the polynomial (X°—2)(X?+3) € QLX] 
and let L = Q(a,). Show that L/Q is a Galois extension, and determine its 
Galois group, as well as all intermediate fields of L/K. 


‘deg 
5. Consider DL = o( +) as a subfield of C and show that L/Q is a Galois 


v2 


extension. Determine the corresponding Galois group, as well as all intermediate 
fields of L/Q. 
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6. Determine the Galois groups of the following polynomials in Q[.X]: 
(i) X93 +6X?+11X +7, 
(ii) X°4+3X?—-1, 
(iii) X44 2X? — 2. 


7. Determine a splitting field and the Galois group of the polynomial X* — 5 over 
Q, resp. Q(z), as well as all intermediate fields of the arising extensions. 


8. Determine the Galois groups of the following polynomials: 
(i) X4- X?-3¢€F5[X], 
(ii) X44 7X? —3 € Fi3[X]. 


4.4 Symmetric Polynomials, Discriminant, Resultant* 


In the present section we extend the fundamental theorem on symmetric poly- 
nomials 4.3/5 to general coefficient domains, using an alternative method. As 
an application, we study the discriminant of a polynomial, characterizing it in 
terms of the resultant. The discriminant of a monic polynomial f € A [LX] with 
coefficients in a field kK vanishes if and only if f admits multiple zeros in an 
algebraic closure Kv of K; see Remark 3. Similarly, the vanishing of the resultant 
of two polynomials f,g € K[X] indicates that f and g have a common zero in 
K; see Corollary 8. 

In the following we use methods from linear algebra and work with mod- 
ules over rings, as introduced in Section 2.9. Modules generalize the concept 
of vector spaces over fields to arbitrary rings as coefficient domains. However, 
in the present section it is enough to know for a ring extension R C R’, or 
more generally, a ring homomorphism vy: R —> R’, that an additive subgroup 
M Cc R'is called an R-module if r € R, x € M always imply y(r)a € M, where 
for simplicity, we will write ra instead of y(r)«. In particular, R’ itself can be 
viewed as an R-module. A system (2;);e7 of elements in M is called a free sys- 
tem of generators of M if every x € M admits a representation x = ),¢,Tivi 
with coefficients r; € R that are unique and, of course, vanish for almost all 
indices 7. Free systems of generators of modules correspond to bases of vector 
spaces, although their existence is not always guaranteed in the module case. 

Let us consider the polynomial ring R[Ti,...,T,] in n variables T; over a 
ring R. As in Section 4.3, we view the permutation group G,, as a group of au- 
tomorphisms of R[T;,...,7;,], interpreting every 7 € G,, as the corresponding 
R-automorphism 


R(T, ae Tn] — R(T, fale Tn), 
is sie Tn) + fais mista La 
A polynomial f € R[T),...,T;] is called symmetric if it is invariant under 


all elements 7 € G,,. As special examples of symmetric polynomials, we know 
already the elementary symmetric polynomials 
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so = 1, 
Sy] =T; +...4+Th, 
52 = T,T> + TY Ts +...4+T,-1Th, 


Sn = T7432. Tn; 


which, using an additional variable X, are defined in terms of the equation 


(1) [hx-1) = Sic 1y¥egx. 
i=l j=0 
Clearly, the symmetric polynomials in R[T;,...,7;,] form a subring containing 


R as well as all s;. 


Theorem 1 (Fundamental theorem on symmetric polynomials). As before, con- 
sider the polynomial ring R[T] = R[T},...,T,] in n variables over a ring R 
and let 51,...,5, be the corresponding elementary symmetric polynomials. 

(i) The ring R[s1,...,5n] equals the subring of all symmetric polynomi- 
als in R[T], i.e., every symmetric polynomial in R[T] is a polynomial in the 
elementary symmetric polynomials 81,..., Sn. 

(ii) The elements s1,...,8, € R[T'] are algebraically independent over R 
(in the sense of 2.5/6). 

(iii) Let N CN" be the set of all n-tuples v = (14,...,Un), where0< yj, <i 
for 1<i<n. Then the system (T”),en is a free system of generators of R[T] 
as a module over R[s1,..., Sn]. 


Proof. We conclude by induction on n. The case n = | is trivial, since in 


this case, s; = T, and every polynomial in R[T,] is symmetric. Therefore, 
assume n > 1, and let s),...,5/,_, be the elementary symmetric polynomials in 


»°n-1 


R[T,...,Tn-1]. Then one deduces from the equation 


n n—-1 
So (1)! sj, X"4 = (X — Th)» S0(-1) sx" 
j=0 j=0 
the relations 
(2) 8; = 8; + 851Tn, 1<j<n-l, 


as well as sj = so = land s',_,T;, = Sp. Using an inductive argument, this shows 
that s/,...,s5/,_, can be represented in a similar way as linear combinations of 
the s1,..., 8, —1 with coefficients in R[T;,]. Thereby we see that 


(3) Fel Sie Seg O a cae Ul Ri bony See | 


n—-1? 


Moreover, we make the following claim: 


164 4. Galois Theory 


(4) The systems s/,..., 8), 4,Tn, as well as 81,...,8n—-1, Tn, are algebraically 
independent over R. 


To justify this claim we replace R by [Tn] and thereby may assume, due to 
the induction hypothesis, that s{,...,s/,_, are blecbencally independent over 
R[T,], or what amounts to the same, that s{,...,s',_, Tn are algebraically in- 
dependent over R. To obtain the corresponding fact for 51,..., 5,—1, T,, consider 
a nontrivial polynomial f in n—1 variables and with coefficients in R[T,,] such 
that f(si,...,Sn-1) vanishes in R[T\,...,7;,]. Since T;, is not a zero divisor 
in R[T1,...,7;,], we may assume that not all coefficients of f are divisible by 
T,. Now apply the homomorphism 7: R[T1,...,Tn] —> R[T1,...,Tn—-1] sub- 
stituting T,, by 0, which satisfies r(s;) = si, for 7 = 1,...,n —1, due to the 
relations (2). Using the fact that not all coefficients of f are divisible by T,, and 


hence are mapped to 0 by T, we obtain from f(s1,..., 5 —1) = 0 a nontrivial re- 
lation of type g(s},...,9/_ = =0in R[T,...,Tr-1]. However, this contradicts 
the fact that s{,...,s/,_, are algebraically independent over R by the induction 


hypothesis. Therefore, our claim (4) is justified. 

Let us turn now to the assertions of the fundamental theorem. To derive 
(i) consider a symmetric polynomial f in R[T\,...,7;,]. Since all homogeneous 
parts of f are symmetric as well, we may assume f to be symmetric of a certain 
degree m > 0. Using that f is invariant under all permutations of the vari- 
ables Tj,... Tn 1, we can read from the induction hypothesis that it belongs 


to R[s{,...,8',,,Tn] and hence, by (3), to R[s1,..., 8-1, Tn]. Now write f as 
(5) (Se 

with coefficients f; € R[s1,...,5n—-1]. Then every f; is a symmetric polynomial 
in T,,...,7;,, which, as we claim, is homogeneous of degree m — 7. To justify 


this, write the f; explicitly as sums of terms of type cs{!...s,"4'. Viewing such 
a term as a polynomial in 7},..., 7, it is homogeneous of een oe jv;, the 
so-called weight of the term. Furthermore, multiplication by T” yields a homo- 
geneous polynomial of degree 7 + St jv;. Let us write f/ for the sum of all 


terms cs{'...s;"7' in f; that are of weight m—i. Then we get f = > f/T", since 
f is homogeneous of degree m. However, 51,..-,Sn—1, 7, are algebraically inde- 
pendent over R by (4), so that the representation (5) is unique. Therefore, we 
must have f; = f/ for alli, and f;, as a polynomial in T),...,7;,, is homogeneous 
of degree m — i. 

In particular, the coefficient fo € R[s1,...,$n—1] is symmetric and homoge- 
neous of degree min T,,..., T;,. If (5) reduces to f = fo, we are done. Otherwise, 


consider the difference f — fo, which is symmetric and homogeneous of degree 
m in T\,...,7, as well, while 7), divides f — fo by construction. Then, by a 
symmetry argument, s, = 7,...7;, divides f — fp, and we can write 


(6) f = fot 98n, 


where g is symmetric and homogeneous of some degree < m in T},...,Th. 
Finally, induction on m yields f € R[s1,...,$,], as asserted in (i). 
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Next, we need an auxiliary result, whose proof will be given further below: 


Lemma 2. Consider the polynomial ring A[X] in a variable X over a ring A, 
and let h = cyX" +X" 14+...+c, be a polynomial in ALX] whose leading 
coefficient co is a unit in A. Then every f € ALX] admits a representation 
— = fiX' with unique coefficients f; € A[h]. Furthermore, every f; is of 
type f, = ee a;jh! with unique coefficients ay € A. 

In particular, h is algebraically independent over A, and X°,X',...,X 
form a free system of generators of ALX] viewed as a module over A[h]. 


n-1 


To approach assertion (ii) of Theorem 1 we go back to the polynomial (1) 
above, which admits T;, as a zero. Substituting X by T),, we get the equation 


n—-1 
(-1)""1s, = $0(-1)'sj, Tr) = Te — syTR 1 +... + (-1)" 1 5p-1Th. 
j=0 


Now apply Lemma 2 for A = R[51,...,Sn-1], X = Ty, and h = s,. We thereby 
see that s, is algebraically independent over R[s,...,S,-1] and hence that 
$1,---,5n are algebraically independent over R, since s1,...,Sn-1, are alge- 
braically independent over R by (4). Thus, assertion (ii) is clear. Deducing 
assertion (iii) from the lemma is just as easy. Indeed, the system 


S(t ail Ose) fe <7 1} 


forms a free system of generators of R[T),...,T;] over R[.$1,.--,8n—1, Tn], due 
to the induction hypothesis; to check this, use R[T;,] as coefficient ring and ap- 
ply (3). Furthermore, due to the lemma, ¥” = {T°,...,77~'} is a free system 
of generators of R[s1,...,8n-1, Tn] over R[si,...,,]. But then, by a stan- 
dard argument, ¥ = {a’a"; a’ € F,a” € F"} is a free system of generators of 


R[T,...,Tn] over R[s1,...,5n]. 


It remains to supply the proof of Lemma 2. To do this, we have to show 
that every polynomial f € A[X] admits a representation 


nl n—-1 
f= agh! = > dai X") 
i=0 j>0 j>0 i=0 


with unique coefficients a;; € A, or in other words, a representation 
(7) 0 
j20 


with unique polynomials r; € ALX] of degree degr; < n. To achieve this we 
use Euclidean division by h, which exists in A[X], since the leading coefficient 
of h is a unit; cf. 2.1/4. Hence, we can consider the sequence of decompositions 


f=fih+ro, fi=foh+ri, fe = fah+ro, cee 
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where 1o,71,... € A[X] are polynomials of degree < n. The degree of the 
f; strictly decreases, until we reach a point where deg f; < n = degh and 
hence f; = r;. Going backward and putting everything together, we arrive at a 
decomposition as stated in (7). To verify the uniqueness, consider an equation 
0 = js07ih’ for polynomials ro,r1,... € ALX] of degree < n. Using the 
uniqueness of Euclidean division, the decomposition 


O=roth- So rjhi 


j>0 


yields ro = 0, as well as S> j>0 rjh?* = 0. By induction we can conclude that 
r; = 0 for all 7. 


The proof of Theorem 1 suggests another practical principle for represent- 
ing symmetric polynomials in terms of elementary symmetric polynomials, a 
procedure that looks a bit more complicated than the one given in the proof of 
4.3/5. Substituting T;, by 0 in the equation f = fo(s1,-.-,5n—1) + gSn, see (6), 
we get 

f(Th, a Dts 0) = fo(si, PRES Si) 
using (2). This means that the problem of representing f as a polynomial in 
the elementary symmetric polynomials s;,...,5, is reduced to the following 
subproblems: 

(a) Consider the symmetric polynomial f(T),...,T,—1,0) inn — 1 variables 
and write it as a polynomial fo(s{,...,5/,_,) in the elementary symmetric poly- 
nomials s{,...,s/_, in T),...,7,-1 with coefficients in R. 

(b) Replace s{,...,s/, in fo by the corresponding elementary symmetric 
polynomials s1,...,8,-1 in T\,...,7;,, divide the difference f — fo(s51,.-.,Sn—1) 
by s,, and write s>'- (f — fo(s1,---,Sn—1)) aS a polynomial in the elementary 
symmetric polynomials s1,..., Sp. 

Step (a) allows one to reduce the number of variables, whereas (b) will 
reduce the degree of the symmetric polynomial under consideration. Thus, after 
finitely many steps of type (a) or (b) we end up with the desired representation 
of f. 

To give a first application of Theorem 1, we can prove again the asser- 
tion of 4.3/3 and show that every symmetric rational function in n variables 
T,,...,T, with coefficients in a field k can be written as a rational function 
in s1,...,5, with coefficients in k. Indeed, consider a symmetric rational func- 
tion g € k(Ti,...,Tn), say ¢ = f/g with polynomials f,g € k[Ti,...,Th]. 
Multiplying f and g by the product Tree, —fia} m™(g), we may assume that g is 
symmetric. But then f = q-g is symmetric as well. In particular, g is a quotient 
of symmetric polynomials, and hence by Theorem 1 (i), a rational function in 
$1,-.-,5n. Also note that the free system of generators of Theorem 1 (iii) gives 
rise to a basis of k(T),...,T,) over k(s1,..., Sn). 

To give another application of the fundamental theorem on symmetric poly- 
nomials, we want to discuss the discriminant of a monic polynomial. Working 
over the coefficient domain R = Z, consider 
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[[@-n" 


i<j 


as a symmetric polynomial in Z[T),...,7;,]. The latter is a polynomial A in the 
elementary symmetric polynomials s1,...,5,, as we can read from Theorem 1. 
One calls A = A(s;,...,8,) the discriminant of the polynomial 


n n 


[[&-1) = SC 1 sx", 


i=1 j=0 


where X is viewed as a variable over the ring Z[71,...,7;,]. Now, to define the 
discriminant of any monic polynomial f = X"+c,X""!+...+c¢, € R[LX] over 
any coefficient ring R, set 


A; = A(-c1, C2, 055; (—1)"cn). 


In this way, Ay equals the image of A under the ring homomorphism 


yp: Z[s1,..-,5n] —> R, 8; (-1}’c;, 


which extends the canonical homomorphism Z —>+ R. Observe when setting 
up y that the elementary symmetric polynomials s1,...,5,, are algebraically 
independent over Z and thus Z[s1,...,5,] can be viewed as the polynomial 
ring in the n “variables” s,,...,5,. Therefore, existence and uniqueness of y 
follow from 2.5/5. Furthermore, note that the definition of the discriminant A; 
is meaningful also for n = 0, i.e., for the monic polynomial f = 1. Indeed, we get 
A; = 1 in this case, since empty products assume the value 1 by convention. All 
in all, we see that the discriminant A; of a monic polynomial f is a polynomial 
expression in the coefficients of f and as such is an element of the coefficient 
ring R. 


Remark 3. Let R be a ring and f =X" +X" '+...4+¢, € R[X] a monic 
polynomial with discriminant Ar. If f = [[f_,(X — ai) is a factorization over 
a ring R' extending R, then 


Proof. Assuming there is a factorization of f as stated, consider the ring homo- 
morphism 

p: Z2[T1,...,Tr] — RB, TiO a, 
which extends the canonical homomorphism Z —>+ R. It maps the polynomial 
F = JJj_,(X — Tj) to the polynomial F° = f, so that y(s;) = (—1)%c; and 
hence y(Ar) = Ary. Therefore, we get 


as desired. 
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In particular, we thereby see for a monic polynomial f € K [|X] over a field 
kK that its discriminant A; vanishes if and only if f has multiple zeros in an 
algebraic closure K of K. This, in turn, is equivalent to the fact that f and its 
derivative f’ have common zeros in K, see 2.6/2, a fact that can be checked 
by looking at the resultant res(f, f’). More specifically, we want to show in the 
following that the discriminant Ay coincides with the resultant res(f, f’), up to 
sign. As we will see, such a characterization is of particular interest for explicit 
computations of discriminants. 

Now to introduce the resultant res(f,g) of two polynomials f,g, consider 
polynomials 


f =agX™ +a,X™ 1+... + Op, g = bg X" +6,X" 1 +... +b, 


in a variable X and with coefficients in a ring R. Since we do not require ap or 
by to be nonzero, we call m, resp. n, the formal degree of f, resp. g, as well as 
the pair (m,n) the formal degree of (f, g). Then the resultant res(f, g) of formal 
degree (m,n) is defined as the determinant of the following matrix consisting 
of m+n rows and columns: 


A ao as. : pe 
a ay. . > Gin 
TM fo etetitnt ted dead geeeedicheegdterk 
ag ay Am 
V ag ay Am 
(*) A |b by. bp, , 
bo by bn 
Wie Pt tetaessaeebcer a veceteventgen anaes chaning 
bo by. 2 « Op, 
V bo = by s a, os “DR 


where we have to enter zeros at free places. Note that the trivial case m = n = 0 
is not excluded, since the determinant of the empty matrix is 1 by convention. 
In cases in which the formal degree of f, resp. g, is not mentioned explicitly, 
we assume for f # 0 ¥ g that it is given by the degree of f, resp. g. Applying 
the common rules for handling determinants,? we can see immediately that the 
following must hold: 


Remark 4. Consider polynomials f,g € RLX] in a variable X over a ring R, 
as before. Then: 


(i) res(f,9) = (-D”™ res(g, f)- 
(ii) res(af, bg) = a"b™ res(f, g) for constants a,b © R. 


3 Formal rules for computations in terms of determinants remain valid if instead of coeffi- 
cients in a field, we admit coefficients in a more general ring R. Indeed, in a first step, we can 
view the necessary coefficients, say ¢1,...,C,, as variables and observe that the corresponding 
rule holds over the rational function field Q(ci,..., cy) or its subring Z[ci,...,c,]. The rule 
can then be transported to arbitrary rings R by considering suitable ring homomorphisms 
Z[c1,...,;¢r] —> R. 
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(iii) For a ring homomorphism y: R —> R' and the polynomials f?, g? 
transported via p to R'[X], we get res(f?, 9°) = y(res(f, g)). 


Let S' be the transpose of the matrix (*). We want to realize S as an R-linear 
map and accordingly interpret res(f, g) as the determinant of this map. To do so, 
let RLX]; for i €¢ N be the R-module of all polynomials in RLX] of degree < i. 


Lemma 5. Let f,g © RLX] be polynomials as before. 
(i) Fix X*1,...,X° as a free system of generators of R[X]; over R for 
each i. Then the transpose S' of the matrix (*) corresponds to the R-linear map 


®: R[X]n X RLEX]m — RLX] min; (u,v) —> uf + v9. 


(ii) If f is monic of degree m, then along with § = (X™""1,...,X°), 
also 3 = (fX"™1,...,fX°,X™1,...,X°) is a free system of generators of 
RLX] min over R. The corresponding change of basis matrix from § to §’ admits 
determinant 1. 

(iii) If f ts monic of degree m, then res(f,g) equals the determinant of 
the R-endomorphism ®': R[X]m+n —> R[LX]min that is given by multiply- 
ing X™"!,...,X° by g and otherwise leaving fX""',..., fX° fixed. 


Proof. Assertion (i) is immediately clear, since the coefficient vectors of the 
@-images of 
OP) Dyce COO aoe OE”) 


coincide with the rows of the matrix (*) and hence with the columns of the 
matrix S. 

Turning to assertion (ii), represent the members of §’ as linear combinations 
of the free system of generators §. The corresponding coefficient matrix is a lower 
triangular matrix, whose diagonal entries are all equal to 1; use the fact that f 
is monic. In particular, this matrix has determinant 1 and hence is invertible. 
Therefore, ’ is a free system of generators of R[X]m4n, and the basis change 
matrix from ¥ to §’ admits determinant equal to 1. 

In order to verify (iii), check that the map @ is described by the matrix S$ 
if we fix on the source §’ and on the target § as free systems of generators. To 
compute the determinant of & in terms of the determinant of a corresponding 
matrix, we have to consider, on both the source and the target, the same free 
system of generators. For example, we can change from § to ¥ on the source. 
However, using (ii), the corresponding basis change matrix is of determinant 1, 
so that we get det &’ = det S = res(f,g). 


Let us draw some conclusions from the lemma. 


Proposition 6. Let f,g € R[X] be polynomials as before and assume that 
m+n > 1. Then there are polynomials p,q € RLX], degp < n, degq < m, 
such that res(f,g) = pf +g. 
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Proof. We consider the map ® of Lemma 5 and claim that res(f,g), viewed as 
a constant polynomial in R[X], belongs to the image of &. To justify this, we 
use Cramer’s rule 
S+S* = (det S)-E, 

where $* is the adjoint matrix of S$, and E the unit matrix with m+n rows and 
columns; see, for example, [4], Satz 4.4/3, in conjunction with the generalization 
we have given in the proof of 3.3/1. Translated to R-linear maps, the above 
means: there exists an R-linear map ®* : R[X]min —> R[X]m x R[X],, whose 
composition with the map ®: RLX]|m x RLX]n —> R[X]min yields the map 
(det S)-id. In particular, the constant polynomial @o @*(1) = det S = res(f,g) 
belongs to the image of ®, which justifies our claim. 


In particular, we can read from Proposition 6 that res(f,g) vanishes if f 
and g have a common zero. However, the resultant can vanish in additional 
cases, for example, when the leading coefficients ag, by) of f and g are both zero. 
Next we want to prove a characterization that serves as a key ingredient for the 
further study of resultants. 


Proposition 7. Let f € R[X] be a monic polynomial of degree m. View the 
residue class ring A = R[LX]/(f) as an R-module under the canonical map 
R — R[X]/(f) and write x for the residue class of X. Then the powers 
x”, ...,2° form a free system of generators of A as an R-module. Further- 
more, let g € R[X] be a polynomial of degree <n and g(x) its residue class in 


A. Then the resultant of formal degree (m,n) satisfies 


res(f,g) = Nar(g(2)), 


where N ajr(g(x)) ts the norm of g(x), i.e., the determinant of the R-linear map 
— A,ar > g(x)-a. 
In particular, res(f,g) is independent of the choice of the formal degree n 
of g. 


Proof. Since f is monic, we can use Euclidean division by f in R[X], which 
is unique; see 2.1/4. Therefore, the projection RX] —> A induces an iso- 
morphism of R-modules R[X]m > A, and it follows that the elements 
x”! ..., 2°, being the images of X”~!,..., X°, form a free system of genera- 
tors of A over R. This settles the first assertion of the proposition. To derive the 
second, consider the map & of Lemma 5 (iii). The projection RLX]m4n —> A 
admits fR[X], as its kernel. Since &’ maps this kernel into itself, it follows 
that ' induces an R-linear map @: A —> A, which clearly is multiplication by 
g(x). Since & restricts on f R[X], to the identity map, we get det & = det &, 
ie., we have res(f,g) = Nasr(g(x)), due to Lemma 5 (iii). 


Corollary 8. Let f,g be nontrivial polynomials with coefficients in a field K. 
Furthermore, let deg f = m and degg < n. Then the following conditions are 
equivalent: 


4.4 Symmetric Polynomials, Discriminant, Resultant* 171 


(i) The resultant res(f,g) of formal degree (m,n) is nonzero. 
(ii) The polynomials f and g do not admit common zeros in an algebraic 
closure K of K. 


Proof. Following Remark 4, we may assume f to be monic. If res(f,g) 4 0, we 
see from Proposition 7 that the determinant of the multiplication by g(a) is 
nonzero on K[X]/(f) and hence invertible. Then the multiplication by g() is 
invertible itself, which implies that g(a) is a unit in A [X]/(f). Therefore, f and 
g generate the unit ideal in K [|X], and it is impossible that these polynomials 
have a common zero in an extension field of A’. Conversely, assume that f and g 
are without common zeros in K. Then f and g are prime to each other in K[X] 
and therefore also in K [|X]. Hence, there is an equation of type uf +g = 1 for 
certain polynomials u,v € K[X], and we see that g(a) is a unit in K[X]/(f). 
In particular, the multiplication by g(a) is invertible on K[X]/(f), and the 
corresponding determinant is nonzero. But this implies res(f,g) 4 0, due to 
Proposition 7. 


Corollary 9. Let 


f=a][(x-o),  9=8]](x - 4), 


be factorizations of two polynomials f,g € R[X] with constants a, 8 € R, as 
well as zeros Q1,...,Qm, 31,-.-,;8n belonging to a ring R! extending R. Then 
the resultant of formal degree (m,n) satisfies 


res(f,g) = 0” Tf g(a) = 08" TT (ai — 6). 
a lejen 


Proof. Making use of Remark 4, we may assume R = R’, as well as a = 1 = 8, 
and hence that f and g are monic polynomials admitting a factorization into 
linear factors in R[X]. It is easily checked that res(X — a;,g) = g(a;), either 
by going back to the definition of the resultant or by applying Proposition 7. 
Moreover, we conclude from the multiplicativity of the norm of elements in 
R[X]/(f) over R, in other words from the multiplicativity of the determinant, 
that any two polynomials g1, 92 € R[X] satisfy the relation 


res(f, 9192) = res(f, 91) - res(f, ga). 


Then, applying Remark 4 (i), we can derive for polynomials f,, fo in R[X] of 
degree deg f; < m; the relation 


res(fifo,9) = res( fi, 9) . res(f2, 9), 


where the resultants are meant with respect to the formal degrees (m,+ mz, 7), 
resp. (m,n) and (m2,n). Applying the above equation repeatedly to the fac- 
torization of f yields the desired assertion. 
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Now let us characterize the discriminant in terms of the resultant, as was 
already announced before. 


Corollary 10. Let f € R[X] be a monic polynomial of degree m > 0 and f' 
its derivative. Then the discriminant A; is related to the resultant res(f, f’) of 
formal degree (m,m— 1) by 


As = (-1)™""9? res(f, f'). 


Alternatively, let A = R[X]/(f) and write « € A for the residue class of the 
variable X € R[X]. Then 


Ay = (-1)™ "VPN ae(f'(2)). 


Proof. The second equation follows from the first one by means of Proposition 7. 
To derive the first equation, we may replace R by a suitable extension ring; use 
the definition of the discriminant in conjunction with Remark 4 (iii). Thereby 
we may assume that f decomposes over R into a product of linear factors. 
Indeed, using the idea of Kronecker’s construction 3.4/1, we can replace R ina 
first step by R’! = R[X]/(f). Then f admits at least one zero in R’, namely the 
residue class % of X. Dividing out the linear factor X — % from f, we obtain a 
monic polynomial of degree m—1, which can be treated in the same way again, 
so that after finitely many steps, we arrive at a ring extending R over which f 
decomposes completely into a product of linear factors. 
Therefore assume f = ]]j",(X — a;), so that 


m 


i=1 
by Corollary 9. Furthermore, the product rule for derivatives yields 


m 


fi = 0(X ay)... (X — a4-1)(X — a441)... (X — Om) 


w=1 
and hence 
f'(ai) = (a; = Q1) ee (a; ai Qi-1) (Qi; 7 Qi41) bane (ay; = Qm)- 


However, this means that 


res(f, f’) = [](ai — ay) = (19? TY (ai — a)? = (1m Ay, 


iAj i<j 


as claimed. 


Observe that Corollary 10 provides in particular an explicit formula for 
determining discriminants. Let us look at a simple example and compute the 
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discriminant of the polynomial f = X° + aX +b € R[X], which was needed 
in Section 4.3 for the case of a field K = R in order to study the Galois group 
of f. Following Corollary 10, we get 


Ay = —res(f, f’) = — det 


oo wor 
OW or © 
wos os 
oa on om 
geooco 


To simplify the computation of the determinant, one should subtract 3 times 
row 1 from row 3, as well as 3 times row 2 from row 4. Thereby we get 


—2a -—3b 0 
A; =—det | 0 2a —3b} = —4a? — 270°. 
3 0 a 


Following the same procedure, one shows for m > 2 that the discriminant of 
the polynomial f = X™ + aX + b is given by 


A; = (—1ymer-"YP (1 = m)m™ta™ 4. mirpr—1) 
Also the case f = X4+ aX? + bX +c is quite simple. We obtain 
Ay = 144ab?c — 128a7c? — 4a°b? + 16a*c — 27b4 + 256c%, 


where similarly as in the preceding examples, one reduces the computation 
of the corresponding seven-row determinant to a four-row determinant, using 
elementary row transformations. 

Finally, let us point out that discriminants are usually considered in a some- 
what more general setup. Let R C A be a ring extension such that A, as an 
R-module, admits a finite free system of generators e1,...,€n. Then, just as for 
vector spaces, one can define the trace of R-linear maps y: A —> A. Indeed, 
if y is described by the matrix (a;;) € R"*", with respect to a free system of 
generators of A over R, then one defines the trace of y by tr(y) = Oy, ax € R.- 
In particular, one can consider for a € A the R-linear map 


Ya: A— A, ct— ax, 


given by multiplication by a. Its trace is denoted by tra/pr(a) and, more specif- 
ically, is called the trace of the element a € A with respect to the extension 
A/R; see also Section 4.7. 

Now, if 21,...,%, is a system of elements in A, one looks at the matrix 


aaees 


respect to the ring extension R C A by 


Dajr(41,---,En) = det (trae (tix;)); 
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To relate the discriminant of a polynomial Af, as defined before, to the discrim- 
inant of a system of elements Dy/p(a1,--.,%p), we prove the following result: 


Proposition 11. Let f € RLX] be a monic polynomial of degree n > 0. As in 
Proposition 7, consider A= R[LX]/(f) as an R-module with the free system of 
generators x°,...,2"~', where x € A is the residue class of X € R[X]. Then 


A; = Dajr(x®, i ga) 


Proof. We start with a reduction step showing that the asserted equation needs 
to be established only for certain rings R and polynomials f. Consider a ring 
homomorphism +: R —>+ R together with a monic polynomial f € R[X ] of 
degree n that is transported via 7 to f. Then 7 gives rise to a ring homomor- 
phism oo 7 

ot: A= REX]/(f) 3 RIXI/(f) =A. 


nm 


aes, 
free system of generators of A as an R-module, just as x°,...,2"~+ form a free 
system of generators of A as an R-module. Furthermore, we have 7/(z') = 2’ for 
i=0,...,n—1. We want to relate the multiplication by an element a € A, 18, 
the R-linear map Pa: A — A, tothe corresponding R-linear map ~,: A — A, 
where a = 7'(@). Since 7’ is a ring homomorphism, we see that the matrix M, 
describing y, (with respect to the free system of generators x°,...,2"~') is 
obtained from the corresponding matrix Mg of ya by transporting it via 7 from 
R*” to R™”", We therefore get trayr(a) = 7(tr4/R(@)), and in particular, 


Writing % for the residue class of X in R[X]/(f), we can view @°,...,% 


Dayr(x°, Sa a") = (Dayal, _ or"), 
In addition, the definition of discriminants shows that 
As =7(4j). 


As a conclusion, if the assertion of the proposition is known for R and 7 , it is 
known for R and f as well. 
Now consider the diagram of ring and field extensions 


A 0 a 6) Ge! 


Z[s1,---,8n] C Q(s1,-.-, Sn), 


where s1,...,5, are the elementary symmetric polynomials in T),...,7;,. Fur- 
thermore, consider a monic polynomial f = X" +X"! 4+...+¢n € RLX], 
for an arbitrary ring R, as required in the assumption of the proposition. Then 
we can define a ring homomorphism Z[s;,...,8,] —+ R by s; + (—1)%c;. 
The reduction step discussed before shows that the assertion of the proposition 
needs to be proved only for R = Z[s1,...,5n] and f = }0_9(-1)’s;X"7. In 
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addition, since Z[s1,...,8n] is a subring of Q(s1,...,,), we may even assume 
R = Q(s1,.--,5,). Then, as a polynomial in Q(s1,...,8,)[X], we know from 
4.3 that f is irreducible. Therefore, all in all, it is enough to consider the case 
that R = K is a field and f is a monic irreducible polynomial in K LX]. Also 
note that L = A = K[X]/(f) is then a field that is finite over AK. In the 
following, we will consider such a situation. 

By its definition, the discriminant of x°,..., x 


n 


~1 is given by 


Di jx (2°, er a det(trejx(2")), 9 not 


In order to compute the trace trz/q(x'*’), we decompose f over an algebraic 
closure of L into linear factors, say f = [],;_,(X — ax), and anticipate a result 
to be proved later in Section 4.7, namely 4.7/4, from which we can read 


n 
trrjx(a’’) = >» Oi 
k=1 


Now consider Vandermonde’s matrix V = (cen ere that is attached to 
Q1,...,Qpy. Since det V = Tica; —a;) and 


(trz/x (2‘*?)) = yt : V, 


i,j=0,...,.n—1 


it follows that 


Drjx(a®,...,2""") = (det V)? = | J (ai — 0%)? = Ay. 


i<j 


Exercises 


1. Explain the role played by the fundamental theorem on symmetric polynomials in 
defining the discriminant Ay of a polynomial f =X" + a,X"-! +... +4. Why 
should one consider symmetric polynomials with coefficients in rings, even if the 
discriminant is needed only for polynomials over fields? 


2. Look at the polynomial ring R[7),7>,73] over a ring R and write the symmet- 
ric polynomial T3 + TS + re as a polynomial in the corresponding elementary 
symmetric polynomials. 

3. For a ring R, verify the following formulas for discriminants Ay of polynomials 
f € RLX]: 

(i) f =X*+aX +), 
Ay= a? — 4b. 
(ii) f=X™+ aX +b for m> 2, 
Ay = (1-92 (0 = mya pe). 
(ii) f = X93 +aX2+bX +0, 
Ay = ab? + 18abe — 4a%c — 4b? — 27’. 
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(iv) f =X4+4+aX?+0X +6, 
Ay = 144ab?c — 128a7c? — 4a°b? + 16a*c — 27b* + 2563. 
4. Let R be a ring. For two polynomials f,g € R[X] of formal degree m,n, deter- 
mine the corresponding resultant res(f,g) in the following cases: 
(i) g = 90 € R (constant polynomial), m = 0. 
(ii) g = 90 € R (constant polynomial), m= 1. 
(iii) f=aopX +41, g=bX +h1, m=n=1. 


5. Show for polynomials over a ring R that 


res(ag.X? + a,X + ag, bo X? + b,X + be) 
= (aob2 = abo)” + (ayb2 = a2b1)(a1bo = agby). 


6. Show for monic polynomials f,g € R[X] over a ring R that 
Ajg = As Ag: res(f, 9)” 


7. Let R be a ring and f € R[X] a monic polynomial. Show for c € R that the 
discriminants of the polynomials f and g = f(X +c) coincide. 


4.5 Roots of Unity 


Let us fix a field K and an algebraic closure K of it. The zeros of the polynomial 
X”" —1, where n € N—{0}, are called the nth roots of unity (in K). These roots 
form a subgroup U, Cc K*. If char K = 0, or more generally, if char K does 
not divide n, then X" — 1 and its derivative D(X" — 1) = nX"~' do not 
admit common zeros. This shows that X” — 1 is separable in this case, so that 
ord U, = n. Otherwise, for p = char K > 0, consider a factorization n = p'n’ 
where p{n’. Then the polynomial X” — 1 is separable, as explained before, and 
its zeros coincide with those of X"—1, since X"—1 = (X"’—1)?". Hence, we get 
U,, = U,, and therefore ord U,, = n’. In particular, for the study of the groups 
U,, we may restrict ourselves to the case char A {n. Our main objective in the 
following is to study field extensions that occur by adjoining roots of unity. To 
begin with, let us reformulate the result 3.6/14: 


Proposition 1. Let n © N—{0} be an integer such that char K {n. Then the 
group U,, of nth roots of unity in K is cyclic of order n. 


A root of unity ¢ € U,, is called a primitive nth root of unity if it gener- 
ates the group U,. For example, the fourth roots of unity in C are given by 
1,7,-1,-7, while i and —i are the primitive ones among them. All roots of 
unity in C are of absolute value 1 and hence are situated on the unit circle 
{z €C; |z| =1}. The nth roots of unity in C can easily be described in terms 
of the complex exponential function. They consist of all powers of the primitive 
nth root of unity ¢, = e?"/", as is illustrated for n = 6 by the following figure: 
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In view of the complex case, we can say that the roots of unity split the unit 
circle line into equal parts. Extension fields of Q constructed by adjoining a 
root of unity are referred to as cyclotomic fields, from Greek “kiklos” (circle) 
and “tomé” (a cutting). 


Remark 2. Let m,n € N— {0} be prime to each other. Then the map 
h: Un X Uy, — Umn; (¢,) —> Cn, 


is an isomorphism of groups. If Gn € Um is a primitive mth root of unity and 
cn € Un a primitive nth root of unity, then CnC, 1s a primitive mnth root of 
unity. 


Proof. We may assume that char Kk does not divide mn. Since U,, and U;, are 
subgroups of Um», we see that h is a well-defined homomorphism of commutative 
groups. Furthermore, U,,, and U,, are cyclic by Proposition 1, of orders m and 
n. Now if Gy, € Um is a primitive mth root of unity and ¢, € U, a primitive 
nth root of unity, we see from 3.6/13 that (Gn,¢n) € Um x Un, as a product of 
(Cm; L) and (1, ¢,), is of order mn and thus generates the group U,, x U,. In the 
same way we conclude from 3.6/13 that Gm¢n € Umn is of order mn and thus 
is a primitive mnth root of unity generating U,,,. In particular, we thereby see 
that h is an isomorphism. 


Next we want to look more closely at the primitive ones among the nth 
roots of unity. To determine their number we need to introduce a function that 
is referred to as Euler’s y-function. 


Definition 3. For n € N — {0} let y(n) denote the order of the multiplicative 
group (Z/nZ)*, i.e., of the group of units of the residue class ring Z/nZ. The 
map py: N— {0} —> N is called Euler’s y-function. 


Remark 4. (i) Let n € N—{0}. Then 


(Z/nZ)* = {@ € Z/nZ; a € Z such that ged(a,n) = 1} 
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and hence 
y(n) = #{a EN; 0<a<nsuch that gcd(a,n) = 1}. 


(ii) If m,n € N— {0} are prime to each other, then p(mn) = y(m)y(n); 
this property is referred to as the multiplicativity of the y-function. 

(iii) Consider a prime factorization n = p{'...py" with exponents v, > 0, as 
well as distinct prime numbers p,. Then 


“Tee, _ 1). 


Proof. For elements a € Z and n € N — {0}, we have gcd(a,n) = 1 if and 
only if aZ+nZ = Z, ie., if and only if there are elements c,d € Z such that 
ac + nd = 1; cf. 2.4/13. This is equivalent to the fact that the residue class of 
a is a unit in Z/nZ, which verifies assertion (i). 

To establish (ii), use the Chinese remainder theorem in the version 2.4/14. 
It provides for numbers m,n € N — {0} that are prime to each other, an iso- 
morphism of rings 

Z/mnZ ~ Z/mZ x Z/nZ, 


and thereby an isomorphism between the corresponding groups of units 
(Z/mnZ)* > (Z/mZ)* x (Z/nZ)*, 


which implies the multiplicativity of the y-function. 

To verify (iii) we may apply (ii). Then it is only necessary to determine 
y(p”) for a prime number p and a positive exponent v. Since the products 
0-p,1-p,...,(p’-+ — 1)-p represent precisely those natural numbers d such 
that 0 <d< p’ and d is divisible by p, we conclude from (i) that 


p(p") =p’ — p’ | =p’ \(p— 1), 


as desired. 


Proposition 5. Let n € N. An element @ generates the additive cyclic group 
Z/nZ if and only if @ is a unit in the residue class ring Z/nZ. In particular, if 
n #0, then Z/nZ contains precisely p(n) elements generating Z/nZ as a cyclic 


group. 


Proof. Clearly, Z/nZ is generated by @ if and only if the residue class I of 1 € Z 
is contained in the cyclic subgroup generated by @. This is the case if and only 
if there is some r € Z satisfying 1 =r-@=T-@, i-e., if and only if @ is a unit 
in Z/nZ. 


Corollary 6. Let K be a field and n € N—{0} an integer such that char K fn. 
Then the group U, of nth roots of unity contains precisely p(n) elements that 
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are primitive. If ¢ € Un is such a primitive nth root of unity, then ¢" for some 
integer r € Z is a primitive nth root of unity as well if and only if the residue 
class of r modulo n is a unit in Z/nZ, i.e., if and only if gcd(r,n) = 1. 


Proof. Following Proposition 1, the group U,, is isomorphic to Z/nZ. There- 
fore, U, contains precisely y(n) primitive nth roots of unity, as we can read 
from Proposition 5. Furthermore, for a primitive nth root of unity ¢ € U,, the 
homomorphism Z/nZ —+ U,, mapping the residue class I € Z/nZ to ¢ is an 
isomorphism. Hence, a power ¢" for some r € Z is a primitive nth root of unity 
if and only if its preimage in Z/nZ, namely the residue class 7, generates the 
group Z/nZ. Due to Proposition 5, the latter is the case if and only if F is a 
unit. 


If ¢, € K is a primitive nth root of unity, the field K(¢,) contains all nth 
roots of unity. Hence, it is a splitting field of the polynomial X"—1 € K [LX] and 
is therefore normal over K. In addition, since X” — 1 is separable for char K fn, 
we recognize K(¢,)/K as a finite Galois extension. For kK = Q, the field Q(¢,) 
is called the nth cyclotomic field. 


Proposition 7. Let K be a field and G, € K a primitive nth root of unity, 
where char K{n. Then: 
(i) K(¢,)/K is a finite abelian Galois extension of a degree dividing p(n). 
(ii) Every o € Gal(K(¢,)/K) induces by restriction an automorphism of the 
group U,, of nth roots of unity, and the map 


w: Gal(K(¢,)/K) — Aut(U,), at olp,, 


is a monomorphism of groups. 
(iii) The automorphism group Aut(U,,) is canonically isomorphic to the mul- 
tiplicative group (Z/nZ)*, namely via the map 


(Z/nZ)* —+ Aut(U,), Fr> (Cr >C), 
which is well defined. 


Proof. We know already that K(¢,)/K is a finite Galois extension. Using as- 
sertions (ii) and (iii), we may view the Galois group Gal(/<(¢,)/K) as a sub- 
group of (Z/nZ)*. In particular, Gal(K(¢,,)/K) is abelian of some order dividing 
ord(Z/nZ)* = p(n). Therefore, assertion (i) is a consequence of (ii) and (iii). 

To approach (ii) observe that every Galois automorphism o € Gal(K(¢,)/K) 
maps zeros of X" — 1 to such zeros again. Since in addition, o is multiplicative 
and injective, we recognize oly, as an automorphism of the finite group Up. 
Thus, ¢ +> oly, defines a group homomorphism Gal(K(¢,)/&) —> Aut(U,), 
which is injective, since every automorphism o € Gal(K(¢,)/K) leaving ¢, € Un 
fixed must coincide with the identity map. 

Finally, to establish assertion (iii), we use the fact that U,, is cyclic of order 
n and hence is isomorphic to Z/nZ. Then, writing 7 and @ for the residue classes 
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in Z/nZ of elements r,a € Z, we have to show that the map 
w': (Z/nZ)* — Aut(Z/nZ), TH+ (@+>1r-@), 


is a well-defined group isomorphism. First, observe that 7)’ is well defined, since 
we have r-@ = 7G =7-G. Furthermore, it is clear that multiplication by a unit 
7 € (Z/nZ)* yields an automorphism of the additive group Z/nZ and that 
is injective. To show that vy’ is surjective as well, consider any automorphism p 
of the additive group Z/nZ and let F = p(1). Then, for a € Z, we can write 


p(@) = p(1-a) = p(1)-a=7-G, 


and it turns out that p coincides with multiplication by 7. Since such a multi- 
plication can be surjective only for units FT € Z/nZ, we get FT € (Z/nZ)* and 
hence p = w(T), so that w’ is seen to be surjective. 


Proposition 8. Let ¢, € Q be a primitive nth root of unity. The nth cyclotomic 
field Q(¢,,) is a finite Galois extension of Q of degree [Q(¢,) : Q] = y(n) with 
Galois group 


Gal(Q(Gn)/Q) 2 Aut(U,) ~ (Z/nZ)*. 


Proof. Since [Q(¢n) : Q] = deg f, where f € Q[X] is the minimal polynomial 
of ¢, over Q, it is enough to show that deg f = y(n). The characterization of 
the Galois group Gal(Q(¢,)/Q) is then a consequence of Proposition 7. Also 
note that 

[Q(Gn) : Q] = deg f < y(n), 
again by Proposition 7. 

We claim that every primitive nth root of unity ¢ € U, is a zero of f, which 
then implies deg f = y(n) by means of Corollary 6. To justify this claim observe 
that f, being the minimal polynomial of ¢,,, will divide X” — 1. Hence, there is 
a polynomial h € Q[X] satisfying 


HPSS Poke 


Since f is monic by definition, the same is true for h, and we have f,h € Z[X], 
due to 2.7/6. 

Now let p be a prime number such that p{n. Then ¢? is a primitive nth root 
of unity by Corollary 6, and we want to show that it is a zero of f. If such is not 
the case, i.e., if f(¢?) #0, we must have h(¢?) = 0. In other words, ¢, is then a 
zero of h(X?). This, in turn, implies f|h(X”), say h(X”) = f -g, where just as 
before, g is a monic polynomial in Z[X] by 2.7/6. Now, reducing coefficients 
modulo p, we consider the homomorphism 


2X] — Z/p2[X] =F,[X], oa X*-3 SO axs, 


extending the canonical projection Z —> Z/pZ. Then AP = A(X?) = fg 
shows that h and f cannot be prime to each other in F,[X]. Therefore, the 
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polynomial X" — 1 = f-h € F,[X] admits multiple zeros in an algebraic 
closure of F,, which, however, is in contradiction to p{n. Consequently, the 
assumption f(¢?) 4 0 above is untenable, and ¢? must be a zero of f. 

Now we can show for an arbitrary primitive nth root of unity ¢ that it is 
a zero of f. Assume ¢ = ¢7", where gcd(m,n) = 1, due to Corollary 6. Then 
we can obtain ¢ from ¢, by taking successive powers with prime exponents not 
dividing n. Therefore, a repeated application of the argument explained above 
shows that f(¢) = 0. 


For relatively prime integers m,n € N — {0} and primitive mth, nth, and 
mnth roots of unity Gm;Cn; Gmn € Q, the decomposition 


(Z/mnZ)* > (Z/mZ)* x (Z/nZ)* 


discussed in the proof of Remark 4 (ii), together with the assertion of Proposi- 
tion 8, yields a decomposition of Galois groups 


Gal(Q(Gnn)/Q) + Gal(Q(4m)/Q) x Gal(Q(Gn)/Q), 
which we want to study in more detail. 
Corollary 9. Let Gn,Gn € Q be primitive mth and nth roots of unity, where 
gcd(m,n) = 1. Then 
QCm) A ACn) = Q, 
and the map 


Gal(Q(Gms Gn)/Q) —+ Gal(Q(¢m)/Q) x Gal(Q(¢n)/Q), 
7 —> (FQ6m)> TlOGn))s 


is an isomorphism. 


Proof. We know from Remark 2 that Gun = GnGn is a primitive mnth root of 
unity. Therefore, the composite field of Q(G,) and Q(¢,) in Q is given by 


Q(Gm) -QGnr) _ Q(Gmn)- 


There is the following diagram of field extensions, 


y(n) p(m) 
Q(Gm) Qn) 
v(m) y(n) 


where the degrees of Q(Gnn), Q(Gn), and Q(G,) over Q are given by y(mn), 
y(m), and y(n), due to Proposition 8. Furthermore, y(mn) = v(m) - y(n) 
implies 
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[Q(Gnn) : Q(Cm)]| = p(n), [Q(mn) : QGn)] = p(m). 


Now let L = Q(Gn) MN Q(¢n). Since we have Q(Gnn) = Q(Gm; Gn), it becomes 
clear that ¢,, is of degree v(m) over Q(G,) and hence of degree > y(m) over L. 
Then the estimate 


elm) = [O(6m) : Q] = [OGm)  L] - [Z: Q] = elm): [LQ] 


implies [Z : Q] = 1 and thus L = Q. In particular, applying 4.1/12 (ii), we 
see that the map specified in the assertion provides an isomorphism between 
Gal(Q(Gnn)/Q) and the Cartesian product of the Galois groups Gal(Q(Gn,)/Q) 
and Gal(Q(¢n)/Q). 


Next let us factorize the polynomial X” — 1, which admits the nth roots of 
unity as its zeros, into so-called cyclotomic polynomials. 


Definition 10. Let K be a field. For n € N—{0} and char K'fn, let G,.-., Goin) 
be the primitive nth roots of unity in K. Then 


y(n) 


®, = [[«-@ 


i=1 


is called the nth cyclotomic polynomial over K. 


Proposition 11. (i) &, is a monic separable polynomial in K(X] satisfying 
deg, = o(n). 

(ii) If K =Q, we have ®, € Z[X], and ®,, is irreducible in ZLX], as well 
as in QUX]. 


(iii) X"-1= [[ 
d|n,d>0 
(iv) If ®, € ZX] is the nth cyclotomic polynomial over Q, then the nth 
cyclotomic polynomial over any other field K where char K{n is obtained from 
®,, by applying the canonical homomorphism Z —> K to the coefficients of ®,. 


Proof. Concerning (i), we have only to show that ®,, which does not have 
multiple zeros and hence is separable, admits coefficients in Kk’. To justify this, 
observe that L = K(¢;) = K(G,.--,C,(n)) is a finite Galois extension of kK by 
Proposition 7, and that we have &, € L[X] by definition. Since every Galois 
automorphism o € Gal(L/K) induces a self-map and hence a permutation on 
the set of primitive nth roots of unity, we conclude that @,, is invariant under 
Gal(L/K). But then @, € K[X], due to 4.1/5(i). 

Now assume KK = Q. Since every primitive nth root of unity ¢ is of degree 
y(n) over Q and since &, € Q[X] is a monic polynomial of degree y(n) sat- 
isfying ®,(¢) = 0, we see that ®, must coincide with the minimal polynomial 
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of ¢ over Q; this polynomial is irreducible. Furthermore, @,, is a monic polyno- 
mial dividing (X” — 1), and we get ®, € Z[X] using 2.7/6. Of course, ©,, is 
irreducible in Z[_X] as well, see 2.7/7, so that assertion (ii) is clear. 
Next, the formula in (iii) is obtained by combining factors of the decompo- 
sition 
x”-1= [[ (Xx-¢). 


CEUn 


Indeed, let P; for d € N—{0} be the set of primitive dth roots of unity in K. 
Then U,, is the disjoint union of all Pi, where d|n, d > 0. Therefore, we get 


xe-1= Jf [[%-o= JJ %. 
d|n,d>0 ¢€Pa d|n,d>0 


Finally, to derive assertion (iv), we write ®, for the nth cyclotomic polyno- 
mial over Q, as well as @,, for the corresponding polynomial over any other field 
K under consideration. We have to show that the canonical homomorphism 
7: Z[X] —> K[X] maps ,, to py, ie., that 7(,) = ,, which we shall prove 
by induction on n. The case n = | is obvious, since 


T(®,) =X-1l1= D. 
Therefore, assume n > 1. Then we have over Z the equation 


xX*°=1=46,. Il Py, 


d|n,0<d<n 
as well as over K the equation 
7 “ 2 
X*-1=6.. II Dy. 
d\n,0<d<n 


Using the induction hypothesis in conjunction with the fact that Kr [X] is an 
integral domain, we get 7(®,) = ®,, as desired. 


Departing from @, = X — 1, the formula of Proposition 11 (iii) can be used 
to compute cyclotomic polynomials in a recursive way. For K = Q, the formula 
provides, in fact, the prime factorization of X" — 1 in Z[X] or in Q[X], since 
then the factors ®, are irreducible by (ii). Let us give some examples. For a 


prime number p, we get 
X?—-1=@,-@, 


and hence 
&, = XP14 XP 384+ +1, 


Furthermore, if p and q are distinct prime numbers, then 
XPT _ 1 =, -,-B,- Py, 


and therefore 
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XP] 


Pog = (fete ee 


For instance, we get 


X®-1 


f= 


Gy = 


Let us list the first 12 cyclotomic polynomials explicitly: 


&,=X-1 
@,=X4+1 
6,= X*4+X4+1 
G,= X* +1 


b= X*4 X94 X74 K41 
6,= X*- X41 
Gy = X94 X94 X44 X84 X72 4X41 
@, = X*+1 
Py = X°4+ X39 41 
eek HR eR aX AL 

nemem Gale’ Gat, Gee Ge dar ee, oat dae Grom Gao, areal 
Og = X*— X71 


Looking at the these examples, one might guess that 1 and —1 are the only 
nonzero coefficients that can occur in cyclotomic polynomials. However, that is 
not the case. In fact, 


Pro = x48 a at 4+ x46 x4 x” 9x4! x40 x39 

ie x36 Bi. x dt x34 oh x33 aft, xXx 4 x31 x8 x6 x74 xX x20 
dye xl 4 x6 4xb aie x 4 xb 4 xP xX? x8 9x7 x6 xX 
+X?4+X+41 


is the first cyclotomic polynomial whose coefficients are not all of absolute value 
< 1. The next three polynomials of this type are ®1g5, ®195, and ®y19, where, 
just as for ®jo5, all their coefficients are of absolute value < 2. However, it is 
known from a result of I. Schur that the coefficients of cyclotomic polynomials 
are unbounded. Indeed, for n = p,-...+ Pm with prime numbers p; <... < Dm 
such that p,, <p, + pe, it turns out that the coefficient of X?™ in @, is 1—m. 
Furthermore, one can show using arguments from number theory that for m odd, 
there always exist prime numbers pj,..., Pm Satisfying the preceding conditions. 

Finally, let us have a special look at finite fields, considering the field F, of ¢ 
elements for a prime power q. Recall from 3.8/6 that the Galois group of a finite 
extension F/F, is cyclic of order [F : Fj], generated by the relative Frobenius 
homomorphism F —> F, a+ > a’. 
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Proposition 12. Let ¢ € F, be a primitive nth root of unity and assume that 
gcd(n, q) = 1, where q is a prime power. 

(i) Look at the injection w: Gal(F,(¢)/F,) @ Aut(U,,) of Proposition 7 (ii) 
and use the identification Aut(U,) ~ (Z/nZ)* of Proposition 7 (iii). The relative 
Frobenius homomorphism of F,(¢)/F, is mapped under w to the residue class 
q € (Z/nZ)* attached to q. In particular, y gives rise to an isomorphism between 
Gal(F,(¢)/F,) and the subgroup (q) C (Z/nZ)*. 

(ii) The degree [F,(¢) : F,] coincides with the order of ¢ in (Z/nZ)*. 

(iii) The nth cyclotomic polynomial ®,, is irreducible in F,[X] if and only if 
q generates the group (Z/nZ)*. 


Proof. The relative Frobenius homomorphism over F, is given on U,, by the map 
¢ —> ¢4 and therefore, using the canonical isomorphism Aut(U,) ~ (Z/nZ)* of 
Proposition 7 (iii), corresponds to the residue class 7 € (Z/nZ)*. Furthermore, 
assertion (ii) is a consequence of (i), due to 


[Fq(¢) : F,| = ord Gal(F,(¢)/F,) = ord(q) = ord@. 
Finally, to verify (iii) observe that ®, is irreducible if and only if the equation 


[F,(¢) : Fy] = deg ®, = y(n) holds. By (ii), that is equivalent to the condition 
that 7 generates the group (Z/nZ)*. 


Therefore, we see that a necessary condition for the nth cyclotomic poly- 
nomial ®,, to be irreducible is that the group (Z/nZ)* be cyclic. For example, 
(Z/nZ)* is cyclic for a prime number n = p, see 3.6/14, or more generally, also 
for a prime power n = p” of a prime number p ¥ 2; cf. Exercise 7 below. 


Exercises 


1. Consider a primitive nth root of unity ¢, as well as the nth cyclotomic polynomial 
®, € K[X] over a field K, where char K{n. Show that ®, decomposes over K 
into :p(n)/s distinct irreducible factors of degree s = [K(¢): K]. 


2. Let Gm € Q be a primitive mth root of unity. Determine all integers n such that 
the nth cyclotomic polynomial @,, is irreducible over Q(Gm). 


3. Show: y(n) =n- Il (1—p7?). 
p|n, p prime 
4. Determine the Galois group of the polynomial X° — 1 € F7[X]. 


5. Let ¢ be a primitive 12th root of unity over Q. Determine all intermediate fields 


of Q(¢)/Q. 


6. Let p be a prime number such that p—1 = [[)_, p, factorizes into distinct prime 
factors p,, and consider a primitive pth root of unity ¢, € Q. Show that Q(gp)/Q 
is a cyclic Galois extension and that Q(¢,)/Q admits precisely 2” intermediate 
fields. 
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7. Let p be an odd prime number. Show that the group (Z/p"Z)* is cyclic for r > 0 
and conclude from this that the p’th cyclotomic field Q(¢,~) is a cyclic Galois ex- 
tension of Q. Hint: Consider the canonical homomorphism (Z/p"Z)* —> (Z/pZ)* 
together with its kernel W and show by induction that the residue class of 1 + p 
is an element of order p"~! in W, hence in particular, that W is cyclic. 


8. Verify the following formulas for cyclotomic polynomials @,,: 
(i) Bp (X) = &,(X?"~*), for p prime, r > 0. 
(ii) @,(X) = Dy, p(XPU PE"), for a prime factorization n = pj’... pis 
with distinct prime factors p, and exponents r, > 0. 
(iii) bo,(X) = &,(—X), for n > 3 odd. 
®,(X?) 
®,(X) 
9. Determine all roots of unity that are contained in the fields Q(V/2), Q(z), Q(iv2), 
and Q(iV3). 


(iv) Ppn(X) = 


, for a prime number p such that pfn. 


4.6 Linear Independence of Characters 


In the next two sections we will discuss some methods from linear algebra that 
are of special interest for applications in Galois theory, in particular for the 
study of cyclic extensions in 4.8. The “linear” point of view in Galois theory 
was suggested by E. Artin, who effectively used it in [1], [2] to develop an 
alternative approach to the theory. In a first step, we will study characters. 
Related to Galois theory, characters will occur in the form of homomorphisms 
k* —+ L* between the multiplicative groups of two fields K and L. The main 
objective of the present section is to show that different characters are linearly 
independent. 


Definition 1. Let G be a group and K a field. A K-valued character of G is 
a group homomorphism x: G — K*. 


For a group G and a field K, there exists always the trivial character 
G —+ K*, mapping every element g € G to the unit element 1 € K™. Further- 
more, the A-valued characters of G form a group, whose law of composition is 
induced from the multiplication on K*. Indeed, the product of two characters 
X1,X2: G —> K™ is given by 


Xa X21 G—> K*, gg ya (9)- X2(9). 


Also note that the A-valued characters of G can be viewed as special elements 
of the K-vector space Map(G, Kk’), consisting of all maps from G to K. In 
particular, it is meaningful to talk about linear dependence or independence of 
characters. 


Proposition 2 (E. Artin). Distinct characters x1,...,Xn on a group G with 
values in a field K are linearly independent in Map(G, K). 
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Proof. We proceed indirectly and assume that the assertion of the proposition 
is false. Then there is a minimal number n € N such that there exists a linearly 
dependent system of K-valued characters x1,...,n on G. Of course, we must 
have n > 2, since every character assumes values in K* and therefore cannot 
coincide with the zero map. Now let 


Xi t...-+ anXn = 0 


be a nontrivial relation in Map(G, A’) with coefficients a; € AK. Then a; 4 0 for 
all 7, due to the minimality of n, and we get 


ayX1(gh) +...+ anXn(gh) =0 


for g,h € G. Choose g with the property that yi(g) 4 x2(g); this is possible, 
since y, # x2. Then, varying h over G, we see that 


a1X1(9) oi ale eataick, AnXn(Q) ‘Xn = 0 


is a new nontrivial relation in Map(G, A’). Multiplying the initial one by y1(g) 
and subtracting the new one from it, we get a third relation: 


az(x1(9) — x2(9)) x2 +... + an(x1(9) — Xn(9)) Xn = O. 


This is a nontrivial relation of length n—1, since a2(x1(g)—x2(g)) 4 0. However, 
this contradicts the minimality of n, and it follows that the assertion of the 
proposition is true. 


The preceding proposition can be applied in various settings. For exam- 
ple, if L/K is an algebraic field extension, we see that the system Aut, (L) of 
all K-automorphisms of L is linearly independent in the L-vector space of all 
maps L —+ L. To justify this, restrict K-homomorphisms L —> L to group 
homomorphisms L* —+ L*. 


Corollary 3. Let L/K be a finite separable field extension and 2,...,2n @ 
basis of L as a K-vector space. Furthermore, let o1,...,0n denote the K-homo- 
morphisms of L to an algebraic closure K of K. Then the vectors 


f= (o1(21), bes ,01(2n)), 


a = (On Di) ney Gna) 


give rise to a system that is linearly independent over K. 


Proof. The linear dependence of the €; would imply the linear dependence of 
the o;. However, as we can read from Proposition 2, the o; form a linearly 
independent system. 
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To give another example, consider characters of type 


LK vera’, 
for fixed a € K*. If there are distinct elements a,,...,a, € K*, as well as 
further elements c,...,Cn € K such that 


cay t+...+c,a;, =0 


for all vy € Z, then Proposition 2 shows that cy =... = ¢, = 0. 


Exercises 


1. Let G be a cyclic group and F a finite field. Determine all F-valued characters of 
G, and in particular, specify their number. 

2. Let L/K and M/K be field extensions and consider distinct K-homomorphisms 
01,---,0r from L to M. Prove the existence of elements 21,...,2, € LD such 
that, similarly as in Corollary 3, the vectors & = (o;(#1),...,0i(a,-)) € M’, 
i=1,...,7r, are linearly independent over M. Hint: Look at the map L —> M’, 
xt—> (oi (x),...,0,(x)), and show that M", as an M-vector space, is generated 
by the image of this map. 

3. Let L/K and M/K be field extensions and oj,...,0, distinct K-homomorphisms 
from L to M. Furthermore, consider a polynomial f € M[X1,...,X;] such that 
f(oi(a),...,0r(@)) = 0 for all « € L. Use Exercise 2 and show that f is the 
zero polynomial if K contains infinitely many elements. Hint: Choose elements 
1,...,%- € L as in Exercise 2 and verify in a first step that the polynomial 
g(M%,---.¥-) = FOR o1(@i)Vi,---, 2-4 07 (vi) Yi) is the zero polynomial. 


4.7 Norm and Trace 


In linear algebra one defines the determinant and the trace for endomorphisms 
of finite-dimensional vector spaces over fields. Since we will use these notions in 
the sequel, we give a brief review of them. Let K be a field, V an n-dimensional 
K-vector space, and y: V —> V an endomorphism. The characteristic polyno- 
mial of y is given by 


Xo(X) = det(X -id—y) = So ex", 
i=0 


where (—1)"c, = det(y) is the determinant and —c, = trace(y) the trace of vy. 
If the matrix A = (a;;) € K”"*” represents y with respect to a certain basis of 
V, then 


det(y~) = det(A) = > BUF Oil ih: == Oasis 


TEGn 


trace(y) = trace(A) = S> ii. 
i=1 
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Furthermore, for two endomorphisms y, wv: V — V and constants a,b € K we 
have 


trace(ay + by) = a- trace(y) + b- trace(w), 
det (vy o w) = det(y) - det(v). 


Definition 1. Let L/K be a finite field extension. For elements a € L, consider 
the multiplication map 


Ya: LL, tt > ax, 
as an endomorphism of L as a K-vector space. Then 
trp/K(a) := trace(Yq), Nz/x(a) := det (Pa) 


are called the trace and the norm of a with respect to the extension L/K. 


In particular, try;~«: L —+ K is a homomorphism of K-vector spaces, or 
in more precise terms, a linear functional on L viewed as a K-vector space. 
Likewise, Ny: L* —+ K™* is a group homomorphism, hence a character on 
L* with values in Kk. For example, we have 


Neyr(z) = [z|?. 


Indeed, if z = 7 +iy is the decomposition of z into its real and imaginary parts, 
then the multiplication by z on C is described relative to the R-basis 1,7 by the 


matrix 
| 
you) 


In the following, let us discuss some methods for computing the trace and the 
norm. 


Lemma 2. Let L/K be a finite field extension of degree n = [L : K], and 
consider an element a € L. 


(i) If ae K, then 
trp/x(a) = na, Nz/x(a) =a". 


(ii) If L = K(a) and X" +X" 14+...4+ cy is the minimal polynomial of 
a over K, then 


trr/x(a) =—c1, Nryx(@) = (-1)"en. 


Proof. For a € K the linear map ya: L —> L is described by a times the unit 
matrix of k"*”. This justifies the formulas in (i). Furthermore, if L = K(a), the 
minimal polynomial of a coincides with the minimal polynomial of the endomor- 
phism y,, and hence by reasons of degree, must coincide with the characteristic 
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polynomial of y,. Therefore, the formulas in (ii) follow from the description of 
trace(y,) and det(y,) in terms of the coefficients of the characteristic polyno- 
mial of gq. 


The two cases of Lemma 2 can be combined, thus showing how to compute 
the norm and the trace of elements when one is dealing with arbitrary field 
extensions. 


Lemma 3. Consider an element a € L of a finite field extension L/K, and let 
s=[L: K(a)]. Then 


trp/K (a) = 8° trK(a)/K (a), Nz/K(a@) = (Nia yK(a a))”. 


Proof. Choose a K-basis 71,...,2, of K(a), as well as a K(a)-basis y,..., Ys 
of L. Then the products x;y; form a K-basis of L. Let A € K"*" be the matrix 
describing the multiplication by a on K(a) relative to the basis 21,...,x,. It 
follows that, relative to the basis consisting of the x;y;, the multiplication by a 
on L is given by the matrix 


A. 0 
C = % ) 
0 A 
which consists of s boxes A and of zeros otherwise. Therefore, we get 
trz/K(a) = trace(C) = s- trace(A) = s- trxay/K(a), 
Ny x(a) = det(C) = (det(A))° = (Nxiayx(@))”, 


as claimed. 


Proposition 4. Let L/K be a finite field extension of degree [L : K] = qr, 
where r = [L: K], is the separable degree of L/K. (Note that q is sometimes 
called the inseparable degree of L/K.) If o1,...,0, are the K-homomorphisms 
of L into an algebraic closure K of K, the following formulas hold for elements 
acél: 


trz/x(@) = =a) ai 
NyjKx(a )= (Ile ))" . 


Furthermore, if the extension L/K is not separable for p = char K > 0, then q 
is a nontrivial power of p, and we get trr/x(a) =0 for alla € L. 


Before proving the proposition, let us state the transitivity formulas for 
the trace and the norm; these will be proved together with the assertion of 
Proposition 4. 
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Proposition 5. Let K C LC M be a chain of finite field extensions. Then: 


truyK = trrsK Ott, Nuk = Nyx ° Nuyjt- 


Proof of Propositions 4 and 5. In the situation of Proposition 4 we write for 
elements a € L 


tre (a )= ayo aio 
Nyx (@) = (To ))" ’ 


and show that trz;~K = tripe as well as Nyx = Nik: To do this, we consider 
the special cases of Lemma 2 and apply the transitivity formulas to settle the 
general case. 

First, assume a € K. Since [LZ : K'] = qr and o;(a) = a for all 7, we conclude 
from Lemma 2 that 


trijx(a) = [L: K]-a= q(ra a) = try x(a), 
ae qed (a) Ni ala). 


Now consider the second special case of Lemma 2 and assume L = K(a). Let 
X™ 4X1 +...+4G, € K[X] 
be the minimal polynomial of a over K’, where n = qr. Using 3.4/8 and 3.6/2, 


the latter polynomial admits the factorization 


: i 


utes —a;(a))" 


i=l 


over K. Therefore, we conclude from Lemma 2 that 
trzjx(a) = -c = 19a ) = try x(a), 


Niyx(a) = »= (TL ai(a)) = Nryxe(a). 


In particular, we thereby see that tr and tr’ as well as N and N’ coincide in the 
special cases of Lemma 2. 

Now, if a € LF is arbitrary, consider the chain of finite field extensions 
K Cc K(a) C L and use Lemmas 2 and 3. Then the special cases we have just 
discussed show that 
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trzjx(a) = [L: K(a)] - trx@yx(a) = trx(a/K (LL : K(a)] -a) 
= trix (trex (a(a)) 
= tree (ttre) (@)); 
[L:K(a)] Kila 
Nix (a) = (Ne@yx (a) = Ne@yx (ale*O!) 
= Nxa/« (Nixa) (a) 
= Nk(a)/K (Ni /x(a) (a)). 
Thus, to prove Proposition 4, it is enough to establish the transitivity formulas 
of Proposition 5 for tr’ and N’. The same formulas are then valid for tr and N 
as well, due to Proposition 4. 
Therefore, consider a chain of finite field extensions K C LC M as in 


Proposition 5. Embedding M into an algebraic closure K of K, we may assume 
that. the chain is contained in K. For 


[L:K]=q[L: K]s, [M:L] =@[M: Ls, 
the multiplicativity formulas 3.2/2 and 3.6/7 imply 
[M: K]=a@[M : K]s. 
Assuming 
Hom, (L, K) = 1 Gi eseye ts Hom,(M, Kk) SAT ices Tp 


where the elements o;, resp. Tj, are distinct, we can apply 3.4/9 and thereby 
choose extensions o/: K —+ K of the o;. It follows that, just as in the the proof 
of 3.6/7, we get 


Homi Ayo, ot = Legh p= gus 8} 


with distinct elements oj o 7;. Then it is easy to derive the desired transitivity 
formulas for elements a € M, since we have 


tryx(@ = 19 20 0,0 T;(a 


=n > of i(% > 7 


a Jj 
= trex (try (a)) ) 


as well as a similar chain of equalities for Ni, jx (a). However, note that the last 
line is meaningful only if we know that tr,,;(a) is an element of L, or what is 
enough, that we have tr,,,(a@) = truyz(a). Going back to the situation faced in 
the proof of Proposition 4 above, the latter equality is indeed given, so that we 
can finish the proof of Proposition 4. After this, the general transitivity formulas 
in Proposition 5 are derived from the corresponding ones for tr’ and N’ using 
the assertion of Proposition 4. 
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There is an immediate consequence of Proposition 4: 


Corollary 6. Let L/K be a finite Galois extension. Then trp; and Nz; are 
compatible with Galois automorphisms of L/K, i.e., we have 


trr/« (a) = trr/x(o(a)), Nzx(a) = Nz (o(a)) 
for alla € L, o € Gal(L/K). 


We want to derive some further consequences from Proposition 4. Given a 
finite field extension L/K, we can view L as a K-vector space and consider the 
symmetric bilinear form 


tr: Lx L— K, (x,y) +> trrK (ay). 
By Proposition 4, this map vanishes identically if L/K fails to be separable. 


Proposition 7. A finite field extension L/K is separable if and only if the 
K-linear map trzjx: L —> K is nontrivial and hence surjective. If L/K is 
separable, the symmetric bilinear map 


tr: Lx L— kK, (x,y) > try/K«(zy), 
is nondegenerate. In other words, tr induces then an isomorphism 
a xt— tr(z,-), 


of L onto its dual space Ly 


Proof. We assume that L/K is separable. If o1,...,0, are the K-homomorphisms 
of Z into an algebraic closure of K, we get 


trysK =0,+...+0, 
by Proposition 4. Furthermore, Proposition 4.6/2 on the linear independence of 


characters shows that try/x is not identically zero. Now consider an element x 


of the kernel of L —> L, i.e., an element satisfying tr(z,-) = 0. Then we get 
trz/«(xL) = 0 and therefore necessarily x = 0, since otherwise, we would have 


vl = L, and try;~ would vanish on L. Hence, the map L —> L is injective, 


and since dim L = dim L < oo, also surjective. 


Corollary 8. Let L/K be a finite separable field extension with a K-basis 
X1,.--,€n of L. Then there exists a unique K-basis y1,...,Yn of L such that 
trz/x(aiy;) = 6; for 1,7 =1,...,n. 


Proof. Use the existence and uniqueness of the dual basis of x1,...,%p. 


Exercises 


1. Let L/K be a field extension of degree n < oo. Describe the properties of the set 
{a © L; trp/K(a) = Of. 
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Let F'/F be an extension of finite fields. Describe the kernel and the image of the 
attached norm map N: F™ —> F*. 


Let K be a field and L = K(a) a simple algebraic field extension with minimal 
polynomial f ¢ K[X] of a. Show f(x) =Nz/«(x—a) for ze K. 


. For relatively prime positive integers m,n, consider a field extension L/K of 


degree m. Then every element a € K admitting an nth root in LD already admits 
an nth root in K. 


Let L/K be a finite Galois extension with K-basis 71,...,2%». Show for a sub- 
group H Cc Gal(L/K) that its corresponding fixed field L? is characterized by 
LY = K (tipper (a1), ++, ttgypa (en). 

Let L/K be a finite field extension in characteristic p > 0. Show for elements 
aé€ L that trp/q(@”) = (tres (@))?. 


4.8 Cyclic Extensions 


In 


order to solve algebraic equations by radicals it is necessary to study ex- 


tensions of a given field K that are obtained by adjoining an nth root of some 
element c € K. The aim of the present section is to characterize such extensions 
in terms of Galois theory. We will base our study on the famous Theorem 90 of 


D. 


Hilbert [9], which we will prove first. Recall that a Galois extension L/K is 


called cyclic if its Galois group Gal(L/K) is cyclic. 


Theorem 1 (Hilbert 90). Let L/K be a finite cyclic Galois extension and let 
a € Gal(L/K) be a generating element. Then the following conditions are equiv- 
alent for elements b € L: 


(i) NzjK(0) = 


(ii) There exists an element a € L* such that b=a-o(a)7t. 


Proof. If b= a-o(a)~' for some a € L*, we conclude from 4.7/6 that 


Nz/K(a) 


rc) 


Conversely, consider an element b € L satisfying N;/«(b) = 1. Let n = LL: Ky]. 
Using the linear independence of characters 4.6/2, it follows that 


o° +bo' +b-a(b)-o? +...+b-a(b)-...-077(b) 0%, 


as amap L* —-+ L, is not identically zero. Therefore, there is an element c € L* 


SUC. 


h that 


a:=c+bo(c)+b-a(b)-o7%(c)+...+b-a(b)-...-0% (6) - a7 *(c) £0. 


Applying o and then multiplying by b, we get 
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b-o(a) = ba(c) +b-o(b)- o°(c) +...+6-0(b)-...-6" "()-0"(c) =a, 


since o” = id and b- 0(b)-...- 0" 1(6) = Nzx(b) = 1, due to 4.7/4. 


The preceding theorem can also be interpreted within the more general 
context of Galois cohomology. We want to briefly explain this; for more de- 
tails one may consult Serre [14], Chaps. VI, X. In the following, consider a 
group G together with an abelian group A, as well as an action of G on A, 
by which is meant a group homomorphism G —> Aut(A). Given a finite (not 
necessarily cyclic) Galois extension L/K, we are mainly interested in the case 
that G = Gal(L/K) and A = L*, with G —> Aut(L*) being the canonical 
homomorphism. For ¢ € G and a € A, write o(a) for the image of a with 
respect to the automorphism of A that is attached to o. Then we can consider 
the following subgroups of Map(G’, A), the abelian group consisting of all maps 
from G to A: 


Z'(G,A) = {fF f(c00') =a(f(o')) - f(e) for all a,c’ € ch, 
B'(G, A) = i there exists a € A such that f(¢) =a-o(a)7! for alla € ch. 


The elements of B!(G, A) are called 1-coboundaries; they constitute a subgroup 
of Z(G, A), the group of 1-cocycles. The residue class group 


H'(G, A) = Z(G, A)/BM(G, A) 


is called the first cohomology group of G with values in A. Using such terminol- 
ogy, the cohomological version of Hilbert’s Theorem 90 reads as follows: 


Theorem 2. [f L/K is a finite Galois extension with Galois group G, then 
H'(G, L*) = {1}, i.e., every 1-cocycle is a 1-coboundary. 


Proof. Let f: G —>+ L* be a 1-cocycle. To show that it is a 1-coboundary, look 


at the Poincaré series 
b= > f(o')-o'(0) 
o’EG 
for elements c € L*. Using the linear independence of characters 4.6/2, we can 
choose c in such a way that b 4 0. Then we see for arbitrary o € G that 


o(b) = S) o(f(o')) -(@2a')(c) 
o'EG 


= SF flo): flee’): (coa')(c) = f(a) *-b, 


o’EG 


i.e., that f is a 1-coboundary. 


To derive Hilbert’s Theorem 90 in its original version from Theorem 2, con- 
sider a cyclic Galois extension L/K of degree n and fix a generating element o of 
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the Galois group Gal(L/K). Then one shows for b € L* satisfying Nz/x(b) = 1 
that f: G — L*, given by 


emi, 
ot > b, 


a" + b- a(b)+...+ 0 7(6), 


is a 1-cocycle, and hence by Theorem 2, a 1-coboundary. 
We want to apply Hilbert’s Theorem 90 in order to characterize cyclic ex- 
tensions in more detail. 


Proposition 3. Let L/K be a field extension and n an integer > 0 such that 
char K {n. Moreover, assume that K contains a primitive nth root of unity. 

(i) Every cyclic Galois extension L/K of degree n is of type L = K(a) for 
an element a € L, whose minimal polynomial over K equals X" — c for some 
element cE K. 

(ii) Conversely, if L = K(a) for some element a € L that is a zero of a 
polynomial of type X" —c € K[X], then L/K is a cyclic Galois extension. 
Furthermore, d = [L : K] divides n and satisfies at € K, which implies that 
X4— a’ € K[X] is the minimal polynomial of a over K. 


Proof. Let ¢ € K be a primitive nth root of unity. If L/K is a cyclic Galois 
extension of degree n, then Nz/x(¢~') = ¢-" = 1 by 4.7/2. Furthermore, using 
Hilbert’s Theorem 90, there exists an element a € L* such that o(a) = Ca, 
where o is a generating element of Gal(Z/K’). Then we get 


a*(a) = Ca, t=0,...,n-1. 
In particular, the elements o°(a),...,0"~1(a) are distinct, and we see that 


[K(a) : K] > n, in fact that L = K(a), since K(a) C L and [L: K] =n.4 
Now observe that 


a(a") =o(a)" = ¢"a" =a", 
i.e., that a” € K. Therefore, a is a zero of the polynomial 
X"—a" Ee K[X]. 


Since a is of degree n over K, this polynomial is already the minimal polynomial 
of a over Kk. This justifies assertion (i). 

Now turning to assertion (ii), assume L = K(a), where a is a zero of a 
polynomial of type X" —c € K[X]. We may assume a ¥ 0, since the case 
a = 0 is trivial. Then €°a,...,¢"~'a are n distinct zeros of X”" — c, and we see 


+ Note that in this way we have constructed a special generating element of the field 
extension L/K. However, that this extension is simple follows just as well from the primitive 
element theorem 3.6/12. 
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that L = K(a) is a splitting field over K of this polynomial. Since X" — c is 
a separable polynomial, due to char K'{n, it follows that L/K is even a Galois 
extension. Now, just as a is a zero of X”" —c, the same is true for o(a) for every 
oa € Gal(L/K). Therefore, we can associate to a in each case an nth root of 
unity w, € U, such that o(a) = wea. It follows that 


Gal(L/K) — U), Thr We, 


is an injective group homomorphism, and hence due to the theorem of La- 
grange 1.2/3 that d := [ZL : K] = ord(Gal(L/K)) divides n = ordU,,. Since 
U,, is cyclic by 4.5/1, every subgroup of U,, admits this property as well. In 
particular, Gal(L/I) is cyclic. If o € Gal(Z/K) generates this cyclic group of 
order d, then w, is a primitive dth root of unity, and we have 


so that a4 € K. Clearly, a is a zero of X4 — a? € K[X], and we see by reasons 
of degree that this is the minimal polynomial of a € L over K. 


Next we want to prove an additive version of Hilbert’s Theorem 90. Also in 
this case there is a generalization in terms of Galois cohomology; cf. Exercise 5 
below. 


Theorem 4 (Hilbert 90, additive version). Let L/K be a finite cyclic Galois 
extension and o € Gal(L/K) a generating element. The following conditions 
are equivalent for elements b © L: 


(i) trr/K (0) = 0. 
(ii) There exists an element a € L such that b =a — (a). 


Proof. We proceed similarly as in the proof of Theorem 1. If b = a — o(a) for 
some element a € L, then 


trr/K(b) = trp/K(a) = trr/x (o(a)) = 0 


by 4.7/6. Conversely, consider an element b € L such that trz/q(b) = 0, and let 
n= ([L: K]. Since the trace map trz/x is not identically zero by 4.7/7, there 
exists an element c € L such that trz/«(c) #0. Define a € L by 


a- (trzs«(c)) =b-o(c) + (b+0(b)) -o(c) +... 
+ (b+ o(b) +...+0"7(b)) -o" 1c). 


Applying o yields 


a(a) - (trz/K(c)) = o(b)o?(c) + (o(b) +.07(b)) -oF(c) +... 


Then, using the relations 


198 4. Galois Theory 


see 4.7/4, we get 
(a — o(a)) - trzyx(c) = bo(c) + b07(c) +... + ee) 
— (o(8) +07(0)+...4077 *@) ¢ 
. (a(c) + o°(c ie Na) ae 


and hence b = a — a(a). 


We want to apply the additive version of Hilbert’s Theorem 90 in order to 
study cyclic extensions of degree p for p = char K > 0, a case that is not covered 
by Proposition 3. 


Theorem 5 (Artin-Schreier). Let L/K be a field extension in characteristic 
p> 0. 

(i) Every cyclic Galois extension L/K of degree p is of type L = K(a) for 
an element a € L whose minimal polynomial over K equals X? — X — c for 
some ce K. 

(ii) Conversely, if L = K(a) for some element a € L that is a zero of a 
polynomial of type X° —X —c € K(X], then L/K is a cyclic Galois extension. 
Furthermore, if the polynomial X? — X —c does not split completely into linear 
factors over K, it is irreducible. In this case, L/K is a cyclic Galois extension 
of degree p. 


Proof. Assume first that L/K is a cyclic Galois extension of degree p. Then 
trz/«(c) = 0 for all c € K by 4.7/2. In particular, using the additive version of 
Hilbert’s Theorem 90, there is an element a € L satisfying o(a) — a = 1, where 
is a generating element of Gal(L/k). Therefore, we get 


o'(a) =at+i, i=0,...,p—1. 


Since the elements o°(a),...,0?~'(a) are distinct, we conclude that the degree 
of a over K is at least p, and hence that L = K(a). Furthermore, we obtain 


a(a? — a) =o(a)? — o(a) = (a+ 1)? -—(a+1) =a? -a 


and thereby see that c := a?—a € K. In particular, a is a zero of the polynomial 
X? — X —c € K[X]. By reasons of degree, this is the minimal polynomial of a 
over Kx. 

Conversely, let us assume L = K(a), where a is a zero of a polynomial of 
type f = X?—-X —c€ K[X]. Since ais a zero of f, the same is true for a+1, 
and we see that 
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a,at+1,...,a+p—-1eL 


are the p distinct zeros of f. In particular, if one of the zeros of f is contained 
in K, then all of them belong to K, and f splits completely into linear factors 
over kK. The same argument shows that L is a splitting field of the separable 
polynomial f over K and hence that the extension L/K is Galois. For L = K, 
the extension is cyclic for trivial reasons. Therefore, assume that f does not 
admit a zero in K. We claim that then f is irreducible over K. Indeed, if such 
is not the case, then there exists a factorization f = gh into nonconstant monic 
polynomials g and h. Over L we have the factorization 


and it follows that g is a product of some of these factors. Let d = deg g. The 
coefficient of X¢' in g is of type —da + j for some element j belonging to 
the prime subfield F, Cc K. However, from —da+j € K and p{d we get 
a € K, so that f would have a zero in K. Since this was excluded, f must be 
irreducible over K’. Now use 3.4/8 and choose an element o € Gal(L/K) such 
that o(a) =a+1. Then a is of order > p, and since ord Gal(L/K’) = deg f = p, 
we see that L/K is a cyclic Galois extension of degree p. 


Exercises 


1. In the situation of Theorem 1, fir b € L* and consider elements a € L* sat- 
isfying b = a-a(a)~'. Is there a uniqueness assertion? Furthermore, study the 
corresponding question in the situation of Theorem 4. 


2. Illustrate the assertion of Hilbert’s Theorem 90 for the extension C/R. 


3. Let L be a splitting field of a polynomial of type X” — a over a field K such that 
char K {n. Check whether the extension L/K is always cyclic. Also discuss the 
case K = Q. 


4. Show for a finite Galois extension L/K with Galois group G = Gal(L/K) that 
H}(G,GL(n, L)) = {1}. Hint: Although this is not really necessary, assume that 
K admits infinitely many elements. Then proceed as in Theorem 2 and use Exer- 
cise 3 from Section 4.6. Furthermore, observe for the definition of H!(G, GL(n, L)) 
that a priori, this object has to be viewed as a “cohomology set,” since the group 
GL(n, L) is not abelian for n > 1. Therefore, it is not clear from the outset that 
the corresponding group of 1-coboundaries constitutes a normal subgroup of the 
group of 1-cocycles and hence that the set of residue classes forms a group. 


5. Consider a finite Galois extension L/K with Galois group G = Gal(L/K) 
and equip the additive group LD with the canonical action of G. Show that 
H'(G,L) =0. Hint: Adapt the proof of Theorem 2 to the additive point of view. 


6. Use Hilbert’s Theorem 90 and show for two rational numbers a,b € Q that the 
relation a? + b? = 1 is equivalent to the existence of integers m,n € Z satisfying 


m2 — n? 2mn 


~ m2 +n?’ ~ m2 +n? 
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4.9 Multiplicative Kummer Theory* 


Recall that a Galois extension L/K is called abelian if the corresponding Galois 
group G = Gal(L/K) is abelian. More specifically, it is said to be abelian of 
exponent d for an integer d > O if in addition, G is of exponent d, i-e., if 
o? = 1 for every o € G and d is minimal with this property. In the following we 
want to generalize cyclic extensions by studying abelian extensions of exponents 
that divide a given number n € N — {0}. Such extensions are referred to as 
Kummer extensions, named after E. Kummer, who considered these extensions 
for number-theoretic reasons.° 

In the present section, we assume char K {n and furthermore that K contains 
the group U,, of all nth roots of unity. Given c € K, we write K(c!/") for the 
extension that is obtained by adjoining an nth root of c to K. However, observe 
that cl/", as an element of an algebraic closure of K’,, is unique only up to an 
nth root of unity, while the field K(c!/") itself is well defined. Indeed, it is 
the splitting field of the polynomial X” — c, since K is supposed to contain 
all nth roots of unity. Furthermore, it follows from 4.8/3 that K(c!/")/K is a 
cyclic extension of a degree dividing n. Similarly, fixing a subset C C K, we 
can define the Galois extension K(C'/") that is obtained from K by adjoining 
all nth roots cl” of elements c € C. The resulting field can be viewed as the 
composite field (in an algebraic closure of K’) of all extensions K(c!/"), where 
c varies over C. In particular, we can consider the restriction homomorphisms 
Gal(K(C™")/K) —> Gal(K(c!/")/K) of 4.1/2, giving rise to a monomorphism 


Gal(K(C'")/K) — [[ Gal(K(c"")/K). 
ceC 


Thereby we recognize K(C'/")/K as a (not necessarily finite) abelian extension 
of some exponent dividing n. This fact will be re-proved in Proposition 1 (i) be- 
low in a direct way, without referring to the characterization of cyclic extensions 
as given in 4.8/3. 

Let us write Go for the Galois group of the extension K(C'/")/K. Given 
o € Go and an nth root c!/” of some element c € C, it follows that o(c!/”) is 
also an nth root of c. Therefore, there is an nth root of unity w, € U, such 
that o(c!/") = w,cl!/”. As is easily checked, w, = a(cl/") -c~/" is independent 
of the choice of the nth root c!/” of c. Therefore, we get a well-defined pairing 


cl/n 
(,:): Go x C —> Un, (a,qr4 29 
C n 
In the sequel we will assume that C is a subgroup of K*. Then (-,-) is bimul- 
tiplicative in the sense that 


5 Strictly speaking, an abelian extension L/K of exponent d for some d > 0 is called a 
Kummer extension if char K{d and K contains the group Ug of all dth roots of unity. 
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ao ot(cl/”) aort(cl") r(cl/”) 

OE) ag = splay oo gig > Ne ATs 
a(cl/ncl/n) a(cl/”) o(c/) 

(ye c) = cl/ngi/n = cain iijn = (9,¢) + (0,¢), 


for o,t € Go and c,c’ € C. Furthermore, we get (a,c”) = 1 for o € Go and 
c € K*. Therefore, if we assume that C C A™ is a subgroup containing the 
group K*" of all nth powers of elements in K™, it follows that (-,-) gives rise 
to a bimultiplicative map 


a(cl/) 


Go x C/K*™ — U,, (g,@) > 7 
C n 


which will be denoted by (-,-) again. 


Proposition 1. As before, consider a field K and an integer n > 0 such that 
char K{n and U;, Cc K*. Furthermore, let C C K* be a subgroup containing 
K*", Then: 

(i) The extension K(C")/K is Galois and abelian of some exponent di- 
viding n. Let Go be the corresponding Galois group. 

(ii) The bimultiplicative map 


a(cl”) 
cl/n 


(,-): Go x C/K*" —3 U,, (g,@) > 


? 
is nondegenerate in the sense that it gives rise to monomorphisms 


Yr: Go — Hom(C/K*", Un), ar (a,°), 
po: Cin —> Hom(Ge, Un), EH> (-,0), 


into the group of all homomorphisms C/K*" —+ U,, resp. of all homomor- 
phisms Go —> U,. More precisely, 1 is an isomorphism, while (2 restricts to 
an isomorphism C/K*" > Homeont(Gc, Un) onto the group of all continuous 
homomorphisms Go —> U,,.® 

(iii) The extension K(C'")/K is finite if and only if the index (C : K*") is 
finite. If such is the case, both maps y, and yp of (ii) are isomorphisms, and 
one obtains [K(C¥") : K] = (C: K*"). 


Proof. Assertion (i) follows from the injectivity of y; in (ii). To show that vy 
is injective, consider an element o € Ge such that o(c!/”) = cl/” for all c € C. 
Then o(a) = a for all a € K(C'/"), and we get o = id. Hence, ¢ is injective. 
On the other hand, look at an element c € C satisfying o(c!/") = cl/” for all 


© Within this context, consider Gc as a topological group as explained in Section 4.2, 
and equip U,, with the discrete topology. In this way, a homomorphism f: Go —> U, is 
continuous if and only if H = kerf is an open subgroup in Go, ie., according to 4.2/3 
and 4.2/5, if and only if there exists a finite Galois extension K’/K in K(C'/") such that 
H = Gal(K(C'")/K’), or what is enough, such that H > Gal(K(C'/")/K’). 
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o € Gc. Then cl!” € K and therefore c € K*", and it follows that also is 
injective. This settles the nondegenerateness of the pairing in (ii). 

Next we establish assertion (iii), relying on the injectivity of y,; and yp in 
(ii). If [K(C™") : K’] is finite, then the same is true for Gc, and we can conclude 
that Hom(Gc, U,) is finite. Due to the injectivity of (a, it follows that C/K*" is 
finite. Conversely, if C/K*” is finite, then the same is true for Hom(C/K*", U,,), 
and due to the injectivity of v1, also for Go and [K(C") : K]. Furthermore, 
if the finiteness is given, there exist (noncanonical) isomorphisms 


C/K*" — Hom(C/K*™, U,), Go — Hom(Go, U,), 
as we will show in Lemma 2 below. Therefore, the estimate 


[K(C'") : K] =ordGe < ord Hom(C/K*", U,,) = ord C/K*” 
< ord Hom(Ge, Un) = ord Ge = [K(C'/") : K] 


yields the desired equality [K(C/") : K] = (C': K*") and in addition shows 
that Y1, 2 are isomorphisms. Thus, we are done with the proof of Proposition 1 
in the special case in which [K(C'/") : K] and (C : K*") are finite. 

In the nonfinite case consider the system (C;);cr of all subgroups of C’ such 
that C; > K*" and (C; : K*") < oo. Then we get C = Uier Ci, as well 
ae OG) = er K(C1/"), viewing these fields as subfields of an algebraic 
closure of kK. For every 7 € I there is a commutative diagram 


Go — Hom(C/K*", Un) 


| | 


Go, —S Hom(C,/K**, U,) 


p] 


where the vertical map on the left is restriction of Galois automorphisms of 
K(C")/K to automorphisms of K(C}/")/K (see 4.1/2), and where the ver- 
tical map on the right is restriction of homomorphisms C/K*" —+ U,, to 
C,/K*". As we have seen, all maps Yi, are bijective. Therefore, given a ho- 
momorphism f: C/A*" —+ U,, there exist unique elements 0; € Go, such 
that y1;(o;) = Flespickn for all 2, and it is easily checked that the o; make up a 
Galois automorphism o € Gg satisfying y;(c) = f. Thereby it follows that y, 
is surjective and hence bijective. 

To obtain the stated assertion on ¢, consider for all i € J the commutative 
diagram 

O,/K** + Hom(Go,, Up) 


| | 


cik* =~» Homi Ge, U,); 


where the vertical map on the left is the canonical inclusion, and where the 
vertical map on the right is induced by the restriction map Gc —> Gc,, which 
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was considered before. Since every continuous homomorphism f: Go —> U, is 
induced from a homomorphism of type f;: Gc, —> U,, the assertion on 9 in 
(ii) follows from the bijectivity of yo. 


It remains to establish the duality isomorphisms that were needed in the 
proof above. Note that U,, is cyclic of order n due to 4.5/1 and hence that it is 
isomorphic to Z/nZ. 


Lemma 2. For an integer n € N — {0}, consider a finite abelian group H 
of some exponent dividing n. Then there exists a (noncanonical) isomorphism 


H —™ Hom(H,Z/nZ). 


Proof. Since Hom(-, Z/nZ) is compatible with finite direct sums, we can apply 
the fundamental theorem of finitely generated abelian groups 2.9/9 and thereby 
assume that H is cyclic of some order d, where d|n. Then we have to construct 
an isomorphism 


Z/dZ > Hom(Z/dZ, Z/nZ). 


To do this, let us reduce to the case d = n. We know for every divi- 
sor d of n from the solution of Exercise 2 in Section 1.3 that there exists 
a unique subgroup Hy C Z/nZ of order d and that it is cyclic. Clearly, 
we have Hy C Hy for d’ | d, and it follows that every homomorphism 
Z/dZ —+ Z/nZ factors through Hq. In particular, we see that the canon- 
ical map Hom(Z/dZ, Hz) —> Hom(Z/dZ,Z/nZ) is an isomorphism. There- 
fore, using the fact that Hy ~ Z/dZ, it is enough to specify an isomorphism 
Z/dZ —+> Hom(Z/dZ, Z/dZ) in order to settle the assertion of the lemma. But 
that is easy, since 


Z—+ Hom(Z/dZ,Z/dZ), 1+ id, 


is an epimorphism with kernel dZ and therefore induces an isomorphism, as 
desired. 


Theorem 3. Let K be a field and n > 0 an integer such that char K{n as well 
as U, C K*. The maps 


subgroups C Cc K* ——72"_... abelian extensions L/K 
such that K*" c C of exponents dividing n 


CG y+——. FO, 


Pak «——- i, 


are inclusion-preserving, bijective, and mutually inverse to each other.’ Further- 
more, the Galois group Go of an extension K(C™")/K is characterized by the 
isomorphism 


T Tn order to be able to talk about the set of all abelian extensions of K, we consider these 
extensions as subfields of a chosen algebraic closure K of K. 
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yi: Gg Hom(C/K*", Us). ar (g,°), 


of Proposition 1 (ii). If C/K*" is finite, then Hom(C/K*", U,) and hence also 
Go are (noncanonically) isomorphic to C/K*". 


Proof. Due to Proposition 1 and Lemma 2, it remains only to show that the maps 
@, W are bijective and mutually inverse to each other. Starting with the relation 
Wo ® = id, we consider a subgroup C C K™ satisfying C > K*", where in 
a first step, we assume (C': K*") < oo. Then C’ = (K(C'"))" 19 K® satisfies 
C c C’, and furthermore K(C'’") = K(C"/"). Applying Proposition 1 (iii) 
yields C = C’, as desired. 

If the index (C' : K*") is not necessarily finite, we can apply the preceding 
argument to all subgroups C; C C that are of finite index over K*". Since 
C is the union of these subgroups and since we have K(C'/") = U, Rice, 
it follows also in the general case that C = (K(C'"))" 9 K* and therefore 
Wo =id. 

To justify the remaining relation @oW = id, we consider an abelian extension 
L/K of an exponent dividing n. Writing C = L"m K*, we get K(C") C L 
and must show that these fields coincide. Since we can write L as a union of 
finite Galois and hence abelian extensions, we may assume that L/K is finite. 
Now look at the epimorphism 


q: Gal(L/K) — Go, Tt— Ol«(ci/n); 
which exists by 4.1/2. It is enough to show that the associated homomorphism 
qd: Hom(Gc, U;,) as Hom(Gal(L/K), Cals froofod 


is an isomorphism. Indeed, if such is the case, then the Galois groups of 
K(C'/")/K and L/K will have the same order due to Lemma 2, and we can 
conclude that [L : K] = [K(C'/") : K], as well as L = K(C/”). 

Of course, q* is injective, since q is surjective. To see that q* is surjective 
as well, consider a homomorphism g: Gal(L/) —> U,. Since it satisfies 


g(a0.0') = g(a) -g(0') =o 0g(0') - g(a) 


for o,0' € Gal(L/K), it is a 1-cocycle, in the terminology of Section 4.8. Then 
we can conclude from 4.8/2 that g is already a 1-coboundary, i.e., that there 
exists an element a € L*™ such that g(o) = a-o(a)! for all o € Gal(L/K). 
Since g(a)” = 1 and therefore o(a") = o(a)” = a” for all o € Gal(L/K), we 
conclude that a” € C = L"Q K* and hence that a € K(C'/"). Finally, look at 
the homomorphism 


f:Go— U,, or +a-a(a)", 


and observe that it satisfies g = fog = q*(f). In particular, q* is surjective. 


For an abelian extension L/K of some exponent dividing n, as dealt with in 
Theorem 3, we can easily specify a K-basis of L as follows. Write C = L"n K* 
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and consider a system (c;);er of elements in C’ giving rise to a system of repre- 
sentatives of C/K*". Then, choosing arbitrary nth roots of the c;, the system 
(c; / "Vier is a K-basis of L/K. Indeed, it is clearly a generating system of L/K. 
Moreover, if [L : K] < ov, then it consists of precisely (C : K*") = [L: K] 
elements and therefore is linearly independent as well. In the general case we 
can exhaust L by finite abelian extensions of K to see that (c;/”)ie; is linearly 


independent and thus is a K-basis of L. 


Exercises 


Let K be a field, K an algebraic closure of K, and n > 0 an integer such that 
char A {n and K contains a primitive nth root of unity. 


1. Deduce the characterization of cyclic extensions of K, as given in 4.8/3, from 
Kummer theory. 


2. Consider in K all abelian extensions L/K of exponents dividing n and show that 
there is a largest extension Ly/K among these. Give a characterization of the 
corresponding Galois group Gal(L,/K). 

3. Set K = Q and n = 2 in Exercise 2. Show that Ly = Q(i, V2, V3, V5,...) and 
determine the Galois group of the extension L2/Q. 


4. For c,c’ € K*, consider the splitting fields L, L’ C K of the polynomials X” — c 
and X”" — c’ over K. Show that L = L’ is equivalent to the fact that there exists 
an integer r € N that is relatively prime to n and satisfies c”- cl € K*". 


5. Show for every finite Galois extension L/K that there is a canonical isomorphism 
of groups 
(L"N K*)/K*" + Hom(Gal(L/K), Un). 
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In the preceding section we developed Kummer theory for a field K and an 
exponent n, where char A’ {n. In a similar way, one can study Kummer theory 
for a field K of characteristic p > 0 and p as exponent. The resulting theory 
is referred to as Artin—Schreier theory. More generally, for p = char kK > 0, 
there is a Kummer theory for exponents of type p’, where r > 1, that goes 
back to E. Witt. All these Kummer theories are based on a common skeleton 
and are, so to speak, special cases of a general Kummer theory, which is the 
subject of the present section. Let K,, be a separable algebraic closure of kK. For 
example, choose an algebraic closure of K and consider its subfield consisting of 
all elements that are separable over kK’, where for the moment, we do not require 
any restrictions on the characteristic of kK. Then K,// is a Galois extension. 
The corresponding Galois group G = Gal(K,/) is referred to as the absolute 
Galois group of K. It is viewed as a topological group in the sense of Section 4.2. 


Kummer theory. — As main ingredient, we need for a Kummer theory 
over K a continuous G-module A, where G is the absolute Galois group of k. By 
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a G-module we mean an abelian group A equipped with the discrete topology, 
together with a continuous G-action 


Gx A—A, (g,a) -+ o(a), 


respecting the group law on A; for the definition of a group action one may 
consult 5.1/1 and 5.1/2. In particular, we can view such an action as the ho- 
momorphism G —+> Aut(A) mapping an element o € G to the automorphism 
a(-) of A. Also note that the continuity of the action means for every element 
a € A that the subgroup 


G(A/a) = {0 € G; o(a) =a} 


is open in G. According to 4.2/5, this is equivalent to the fact that G(A/a) is 
closed in G and the fixed field Ks'4/ is finite over K. 

As is known from the fundamental theorem of Galois theory 4.2/3, the inter- 
mediate fields of K,/K correspond bijectively to the closed subgroups of G via 
the mapping L +> Gal(K,/L). Therefore, we can associate to an intermediate 
field L of K,/K, resp. to a closed subgroup Gal(K,/L) C G, the fixed group 


A, = {a€ A; o(a) =a for all o € Gal(K,/L)}. 


If L is Galois over K, or equivalently, if Gal(,/L) is a normal subgroup in G, 
it is easily seen that the G-action on A restricts to a G-action on Ay. Thereby 
we get an action of G/ Gal(K,/L) on Az, where due to 4.1/7, this quotient can 
be identified with Gal(L/k). Thus, given a Galois extension L/K, the G-action 
on A gives rise to an action of the corresponding Galois group Gal(L/K) on 
A,, and we can define the cohomology group H'(Gal(L/K), Ar) in the same 
way as we did in Section 4.8. An essential prerequisite of any kind of Kummer 
theory of a given exponent n is the condition that the cohomological version of 
Hilbert’s Theorem 90 be valid for cyclic extensions, in the following form: 


(Hilbert 90) Let L/K be a cyclic Galois extension of a degree dividing n. 
Then H'(Gal(L/K), Ay) = 0. 


Of course, this condition is not automatically fulfilled; it serves, so to speak, as 
an axiom on which Kummer theory is based. 

Having associated to an intermediate field L of K,/K the fixed group Az, 
there is another construction that goes in the reverse direction. Indeed, for a 
subset A Cc A, consider the subgroup 


G(A/A) = {0 € G; o(a) =a for alla € A} 


of G. It is closed in G, since G(A/A) = (.),<4 G(A/a), where all groups G(A/a) 
are open and hence closed in G, due to the continuity of A as a G-module. 
Therefore, G(A/A) can be interpreted as the absolute Galois group of a well- 
defined intermediate field K(A) of K,/K, namely of 
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K(A) = KE@/4) = fa € K,; (a) =a for allo € G(A/A)}. 


Kummer theory of a given exponent n relies, furthermore, on the choice 
of a surjective G-homomorphism ¢: A —> A whose kernel, denoted by Lp, 
in the following, is a cyclic group of order n satisfying u, C Ax. Clearly, a 
G-homomorphism @: A —> A is meant as a homomorphism that is compatible 
with the G-action in the sense that o((a)) = g(a(a)) for alla € Gandaeé A. 

In Section 4.9 we considered the multiplicative group A = K* under the 
natural action of the Galois group G and with 9: A —> A,a+> a", asa 
G-homomorphism, assuming char K {n. This implies that jz, = ker g coincides 
with the group U,, of nth roots of unity and therefore is cyclic of order n. Using 
the notation above, we have Ay, = L* for intermediate fields L of K,/K, as well 
as K(g~!(C)) = K(C") for C C K*. Also note that we assumed U, C K* in 
Section 4.9 and thereby pz, C Ax. Finally, in 4.9/3 we derived a characterization 
of abelian extensions of exponents dividing n, in the style of the fundamental 
theorem of Galois theory, in fact, in terms of subgroups C C Ax containing 
(Ax). The proof required Hilbert’s Theorem 90 in the version of 4.8/2. 

Now we want to show that the results 4.9/1 and 4.9/3 can be extended to 
the context of general Kummer theory. To do this, consider a subset C' C Ax, as 
well as the subgroup G(A/g~1(C)) C G consisting of all o € G that are trivial 
on g 1(C). Let K(o1(C)) be the corresponding intermediate field of K,/K. 
Next, writing the group law on A additively, we see for a € G and a € 9 !(C) 
that 


goo(a)=a0 (a) = 9a), Le., a(a)-—a€ kerg = py. 


Therefore, every o € G restricts to a bijection g-1(C) —>+ o1(C), and it 
follows that G(A/@~1(C)) is a normal subgroup in G. Then K(g7'(C))/K is 
a Galois extension, due to 4.2/3, resp. 4.1/7, and even an abelian extension, as 
we will see further below. Let Gc be the corresponding Galois group. By 4.1/7, 
it can be identified with the quotient G/G(A/@~'(C)). 

For c € C and a € @'(c), the difference o(a) — a € fy, will in general 
depend on c, but not on the choice of a special preimage a € o~1(c). Indeed, if 
we consider another preimage a’ € o~'(c), say a’ = a+i for some i € ker 9 = [Un, 
then 


a(a’) — a’ = (o(a) + o(t)) — (a+) =0(a) -a. 


Therefore, the map 
(,°): Go x C — Un, (a,c) +> o(a) —a, where a € @ ‘(c), 


is well defined. Restricting to subgroups C C Ax such that o(Ax) C C, we 
obtain similarly as in Section 4.9 a pairing 


(,-): Go x C/p(Axk) — bn, (,€) +> o(a) —a, where a € @ ‘(c), 


that is homomorphic in both variables. 
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Theorem 1. Let K be a field and G its absolute Galois group. Consider a con- 
tinuous G-module A together with a surjective G-homomorphism go: A — A, 
whose kernel [tn is a finite cyclic subgroup in Ax of order n. Assume for every 
cyclic Galois extension L/K of degree dividing n that H'(Gal(L/K), A;) = 0. 
Then: 

(i) Viewing the abelian extensions of K as subfields of K,, the maps 


subgroups CC Ax & abelian extensions L/Ix 
such that (An) C C of exponents dividing n J’ 


C nw. K(g\(C)), 
e(Ar) ANAK | L, 


are inclusion-preserving, bijective, and mutually inverse to each other. 
(ii) For subgroups C C Ax such that o(Ax) C C, the bihomomorphic map 


(-,:): Go x C/p(Ak) — Un, (o,f) + o(a) —a4, where a € go ‘(c), 
is nondegenerate in the sense that it gives rise to monomorphisms 


Pi: Go —> Hom(C/(Ax), Hn), at (o,:), 
Yo: C/p(Ax) — Hom(Ge, fn), cr> pes 


More precisely, yp, 1s an isomorphism, and (pg restricts to an isomorphism 
C/p(Ax) > Homoont (Ge, tn) onto the group of all continuous homomor- 
phisms Go — [ln. 

(iii) The extension K(o~'(C))/K is finite if and only if (C : g(Axg)), the 
index of (Ax) in C, is finite. If such is the case, both maps ~, and ~ of (ii) 
are isomorphisms, and one obtains [K(g~'(C)) : K] = (C:: e(Ax)). 


Proof. Similarly as in the proof of 4.9/1, we start by showing that y, and 
are injective. To do this, consider an element o € Gc such that (o,¢) = 0 for 
all c € C. Then we see that o(a) = a for all a € @ 1(C), and if we choose a 
representative o’ € G of a, that o/(a) = a for all a € yg '(C). However, this 
implies 0’ € G(A/g~!(C)) and hence that ¢ is trivial. Therefore, y; is injective. 
On the other hand, consider an element c € C' such that (o,@) = 0 for allo € Go, 
ie., such that o(a) — a = 0 for all o € Go and for preimages a € og '(c). 
Then every such a is invariant under Go, resp. G, so that a € Ax and hence 
c= (a) € p(Ax). Thus, 2 is injective. As a by-product, the injectivity of yy 
shows that the map @ in (i) is well defined. Indeed, for a subgroup C C Ax 
satisfying C D> (Ax), we see that the extension K(o@~1(C))/K is abelian of 
some exponent dividing n. 

Next, turning to the assertions of (iii), observe that the exponents of Gc 
and C/@(Ax) divide n, due to the injectivity of y, and y2. Therefore, we can 
proceed in literally the same way as in the proof of 4.9/1 (iii), including the 
application of the auxiliary Lemma 4.9/2. Furthermore, to derive from (iii) the 
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isomorphism properties of y~, and gy, stated in (ii), we consider the system 
(C;)ier of all subgroups in C' that are of finite index over (Ax). Then 


C=S0G,  G(A/e(C)) =()G(4/e1C)), 


iel ier 


and therefore 


G(K,/K(9-(C))) = () G(K,/K(o(Ci))), 


so that we can interpret K(@~'(C)) as the composite field of the subfields 
K(g~'(C;)). Now observe that the system (C;);er is directed, i.e., for i,j € I 
there is always an index k € I such that C;,C; C C,. Hence, we can even write 
K(p7'(C)) = User K(@7 (Gi). As a result, the argument given in the proof 
of 4.9/1 (ii) carries over to the present situation and yields the isomorphism 
properties stated in (ii) for y, and go. 

Also concerning assertion (i), we base our argument on the corresponding 
approach given in Section 4.9, notably in the proof of 4.9/3. Starting with the 
equation Wo ® = id, consider a subgroup C C Ax such that C D (Ax), 
and let L = K(g~'(C)). We have to show that C’ = o(A,) N Ax coincides 
with C. By its definition, Az C A is the fixed group of G(A/@~'(C)), so that 
we get o '(C) C Az and therefore C C o(Ar)N Ax = C’. Moreover, since 
G(A/Az) = G(A/@7'(C)), we conclude that 


L=K(p(C)) C K(p(C’)) ¢ K(Ar) = L, 


and therefore L = K(g~'(C)) = K(g7!(C’)). Now if C is of finite index over 
(Ax), we obtain C = C” directly from (iii). Otherwise, consider again the 
directed system (C;);¢7 of all subgroups in C that are of finite index over o(Ax). 
As we have seen, the system of all fields L; = K(o~'(C;)) is directed as well, 
and we get L = Uj<; Li. 

Moreover, we claim that 


(x) A, =|JAu.- 


tel 


Of course, we have Ay D U;<; Az,. To derive the reverse inclusion, consider an 
element a € A;, as well as the corresponding subgroup G(A/a) C G leaving 
a fixed. This subgroup is open in G, since the action of G on A is continuous. 
Using 4.2/5, we see that G(A/a) corresponds to an intermediate field E of K,/K 
that is finite over K. We have even EC L, since G(A/a) > G(A/A_z), where 
the group G(A/Az) coincides with G(A/e7!(C)). Since the system (Li)icr is 
directed, there is an index j € J such that EC L;. In particular, this implies 


aéAgC Ap, C (JA, 


ier 


and thereby the equation (*) above. 
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Now observe that 9(Az,) Ax = C; for all 7, since the indices (C; : @(Ax)) 
are finite. Using (*), this implies g(A;) MN Ag = C and hence W o @ = id. 

To justify the relation ® o W = id, consider an abelian extension L/K of 
some exponent dividing n. Writing C = g(Az)M Ag, we get @ 1(C) C Ar. 
In particular, @~1(C) is fixed by Gal(K,/L), which implies K(@~1(C)) c L, 
and we have to show that this inclusion is, in fact, an equality. To achieve 
this, interpret L as the composite field of finite and necessarily abelian Galois 
extensions L’/K. Each of these extensions L’/IX can be written as the composite 
field of finitely many cyclic extensions. To justify this, it is enough to specify 
subgroups H,; of the Galois group H = Gal(L’/K) such that in each case, 
H/H,; is cyclic and such that (), H; = {1}. Applying the fundamental theorem 
of finitely generated abelian groups 2.9/9 shows that this is indeed possible. 
Therefore, L is the composite field of a family (L;)ier of finite cyclic extensions, 
and it is clearly enough to show that L; C K(g~'(C;)) for C; = e(Az,) N Ak. 
In other words, we may assume L/K to be a finite cyclic extension of some 
exponent dividing n. 

Therefore, let L// be such an extension and consider for C = g(Az)N Ax 
the epimorphism 


q: Gal(L/K) — Go, Tt OlK(p-1(C))s 
as well as the corresponding homomorphism 
ie Hom (Ge, fn) — Hom(Gal(L/K), Ln) frofog. 


It is enough to show that g* is an isomorphism, since 4.9/2 then implies 
ord Gal(L/K) = ord Go and hence L = K(g~!(C)). 

Of course g* is injective, since g is surjective. To see that q* is surjective, 
fix a homomorphism g: Gal(L/K) —> py. Then the relation 


g(ooa') =g(c)+g(0') =c0gG(0') + g(o), g,o' € Gal(L/K), 


shows that g is a 1-cocycle with respect to the action of Gal(L/K) on Az. Hence, 
it is a 1-coboundary as well, due to our assumption on H'(Gal(L/K), Ar). Thus, 
there is an element a € Az such that g(0) = a—o(a) for allo € Gal(L/K). Now, 
using ker 9 = fn, we get 0 0 O(a) = Yo o(a) = (a) for o € Gal(L/K), which 
implies e(a) € p(Ar)N Ax =C. But then we can look at the homomorphism 


f: Go — wn, or>a-—o(a), 


which satisfies g = fog =q"(f), and it follows that q* is surjective. 


As a first example in which Theorem 1 is applicable, we studied in Sec- 
tion 4.9 Kummer theory for an exponent n not divisible by the characteristic 
of the field kK under consideration. From now on we assume p = char K > 0 in 
order to develop Kummer theory for exponents of type n = p’. The case n = p 
(Artin-Schreier theory) is quite simple. Here one considers the additive group 
A = K, with the canonical action of G as a G-module, together with 
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go: A—> A, at— a? —a, 


as a G-homomorphism. Then 4, = ker g equals the prime field in Ax = K and 
thus is a cyclic subgroup in Ax of order p, as required. To make Theorem 1 
applicable, it remains only to establish Hilbert’s Theorem 90. We will do this 
on a more general scale in Proposition 11 below. 


Witt vectors. — Kummer theory in characteristic p > 0 for general expo- 
nents n = p", r > 1, is quite involved and relies on the formalism of Witt vectors, 
introduced by E. Witt, a theory that we want to present now. Given a prime 
number p, the Witt vectors with coefficients in a ring R form a ring W(R), the 
Witt ring on R. Characterizing W(R) as a set, it is given by W(R) = RN, the 
countably infinite Cartesian product of R with itself. However, the sum and 
the product of two elements x,y € W(R) are defined in a nonstandard way by 
expressions of type 


T+Yy= (Silat: oo cy (Palas). 3 


where S,,(,y), P.(z,y) for n € N are polynomials in x,...,%, and Yyo,.--,Yn 
with coefficients in Z, hence polynomials in the first n + 1 components of x, 
resp. y.° If p = p+ 1 is invertible in R, we will see that W(R), as a ring, is 
isomorphic to RN with componentwise addition and multiplication. 

To set up the polynomials S,,P, € Z[Xo,-.-;Xn; Yo,---; Yn] for n € N, 
consider the Witt polynomials 


Wr = So pix?" = Xp" + px?" +... + p"Xn € Z[Xo,..., Xn]. 
i=0 


They satisfy the recurrence formulas 
(«) W,, = Wr-1(X$,...,X?_1) +p" Xn, n> 0, 


and it is seen by induction that in each case, X,, can be written as a polynomial 
in Wo,...,W,, with coefficients in Zz, say 


Xo = Wo, X,=p "Wi -p "We, 
Lemma 2. The substitution endomorphism 


A Mis Se STs als 
f(Xo,---; Xn) > f(Wo,---,Wr), 


mapping the variables Xo,...,Xn to the polynomials Wo,...,W,, is bijective. 
In particular, the maps wy, n € N, give rise to an automorphism 


8 The multiplication of elements in R by integers in Z is defined in the usual way, for 
example, with the aid of the canonical homomorphism Z —> R. 
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Ww: Zl5][Xo, X1,---] —_ Zl 5 (Xo; Xiz<-+]; 
ff (Xo, X41, ae .) -—> f(Wo, W,, an .). 


Proof. Indeed, wy, is surjective, since each of the variables Xo,...,Xn can be 
written as a polynomial in Wo,...,W,. But then w,, is injective as well for 
general reasons; for example, extend coefficients from ZI] to Q and apply 
7.1/9. 

Let us add here an alternative argument showing that w,, is injective. We 
proceed by induction on n. The case n = 0 is trivial, since Wy = Xo. Therefore, 
assume n > 0 and let 


f=>oK-X, = fe ZG) Xo... Xn, 
i=0 


be a nontrivial polynomial in Xo,...,X, with coefficients in VATA such that 
f, £0. Then we obtain 


wWn(f) _ » fi(Wo, rn) W,,-1) . Wi, 
i=0 


where each f;(Wo,...,Wn—1) is a polynomial in Xo,...,Xp,—1 and where fur- 
thermore, f,(Wo,...,W,-1) is nonzero by the induction hypothesis. Now write 
Wn(f) as a polynomial in X,, with coefficients in Z[4][Xo,...,Xn—1]. Since 
p’X,, is the only term in W,, containing the variable X,,, the leading term of 
wn(f) turns out to be p” f,(Wo,...,Wn-1) + X7, and we get w,(f) 4 0. There- 


fore, Ww, is injective. 


In most cases, we will view the polynomials W,, as elements of the poly- 
nomial ring Z| Xo, X1,...], although they actually are polynomials in finitely 
many variables. Proceeding like this, the values W,,() will be meaningful for 
points x € RN, for any ring R. 


Lemma 3. Assume that p is invertible in R. Then the map 


w: W(R) = RN — RX, a-—> (W,(2)) 


neN’ 

is bijective. 

Proof. The homomorphisms w,, and w of Lemma 2 are substitution homomor- 
phisms, and the same is true for their inverses w,' and w~!, due to the universal 


property of polynomial rings dealt with in 2.5/5, resp. 2.5/1. Therefore, there 
exist polynomials W,, € Zs] [Xo,-.-,Xn], n € N, such that 


Wr(Wo,.-.; Wn) = Xn, W,(Wo,---,Wa) = Xp 
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for all n. Since p is invertible in R, the canonical homomorphism Z —> R 

extends (uniquely) to a homomorphism Zs] —> R. Furthermore, the pre- 

ceding relations remain valid if we replace Zs] by R as coefficient ring. As a 

consequence, the map w admits an inverse and therefore is bijective. 
Alternatively, we can view the map 


rs Hom(Z(2] bac AR), ar (f-— f(a), 
as an identification and then interpret w: RN —> RN as the map 
Hom (Z(2] [Xo, Xi,-.-], 8) =% Hom(Z(2] [Xo, Xi,-.-],), 
pr > pow, 


which is induced by the isomorphism w of Lemma 2; here Hom(C, R), for rings 
C and R, means the set of all ring homomorphisms C' —> R. 


Now consider a ring R such that p is invertible in R. Then, departing from 
R® as a ring with componentwise addition “+,” and componentwise multiplica- 
99 


tion “-.”, we can introduce laws of composition “+” and “-” on W(R) by means 
of the formulas 


aty=w '(w(z) +.w(y)), t-y=w (w(x) --w(y)). 


It is immediately clear that W(R) is a ring under these laws. Indeed, ad- 
dition and multiplication on W(R) are defined in such a way that the map 
w: W(R) — RN becomes an isomorphism of rings. 

It can easily be checked that the nth components of a sum x + y or of a 
product x- y of elements x,y € W(R) depend in a polynomial way on the ith 
components of « and y, where 7 < n. More precisely, w is given in terms of 
polynomial expressions with coefficients in Z, and similarly, w—! is given by ex- 
pressions of the same type with coefficients in Zs], so that all in all, coefficients 
in Z[5] are needed. However, we will see at once that coefficients from Z are suf- 
ficient to characterize the laws of composition “+” and “.” on W(R). This will 
enable us to define the Witt ring W(R) also for those rings R in which p is not 
invertible. To explain this we need an auxiliary assertion on Witt polynomials. 


Lemma 4. Let R be a ring such that p= p-1 is not a zero divisor in R. The 
following conditions are equivalent for elements ao,...,@n,bo,-.-,0n € R and 
r € N— {0}: 

(i) a; =b; mod (p") fori =0,...,n. 

(ii) Wi(ao,..-, ai) = Wi(bo,...,b;) mod (p"t*) fori =0,...,n. 


Proof. We proceed by induction on n, the case n = 0 being trivial. Therefore, 
assume n > 0. Conditions (i) and (ii) are equivalent for n—1 in place of n by the 
induction hypothesis. Therefore, if one of the conditions (i) and (ii) holds, we 
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may in either case assume that both conditions are satisfied for 7 = 0,...,n—1. 
Taking the pth power of the congruences in (i) yields 
ae? = mod (pt), i=0,...,n-1, 


7 


since r > 1 and p divides the binomial coefficients ay eee (Gs als In particular, 
using the induction hypothesis we get 


Wyai(06; 2-904) = Wales 8-4) mod (a), 
and by the recurrence formulas (*), 
W,(ao,---,Qn) — Wr(bo,.--;bn) = p"Gn — p"by, mod (p"*”). 
Therefore, the congruence 
W,,(do,---;@n) = Wr(bo,---,0n) mod (p"*”) 


is equivalent to p"a, = p"b, mod (p’*”), hence to a, = b, mod (p"), since p 
is not a zero divisor in R. 


Lemma 5. Let & € Z[C¢,€] be a polynomial in two variables ¢ and €. Then 
there exist unique polynomials yp, € Z[Xo,..-,XnYo:---;¥n],n € N, such that 


WAC eae Yn) = ®(W,,(Xo, Sipe Kali W,.(Yo, aa a) 


for all n. 


Proof. Set = (Xo, X1,...), as well as Y = (Yo, Vi,...), and consider the 


commutative diagram 
Z[s1[X] —> Z[pI[%] 
| 


| 
"| ih 
Z(A1(%, 9] > Z[4][%,9] 
that is determined by 


Ww? Xn > W,, 
W@W: Xp, W,(Xo,---,; Xn), Yn > Wr(Yo,---, Yn), 
Tv: Xp (Xn, Yn), 
1 


one =(w@w)oTow™. 


For this, observe that w is an isomorphism by Lemma 2 and that the same is true 
for w @w). In particular, 7’ is well defined as the unique homomorphism making 
the diagram commutative, i.e., such that the equation T'ow = (w@w)or 
holds. Now set gy, = 7T'(X,) for n € N. Thereby we get unique polynomials 
Yn € Z(5] [Xo0,-.--,Xn, Yo,---, Yn] satisfying 
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WalYo,+++, Pn) = O( Wal Xo, «++; Xn); Wa(Yor++= Yn), neN. 


In particular, in order to verify the assertion of the lemma, it remains to show 
that all polynomials y,, have coefficients in Z. 

To justify this claim we use induction on n. If n = 0, we have Wo = Xo 
and therefore yp = ®, so that yo admits coefficients in Z. Now let n > 0. We 
may assume by the induction hypothesis that yo,...,Qn—1 have coefficients in 
Z. Furthermore, consider the element 


W,(Yo,---; On) = 7 ow(X,) = (w @w)or(X,), 


which, by the definition of w and 7, is a polynomial in ¥ and Q) with coefficients 
in Z. Using the induction hypothesis, the same is true for W,,_1(5,.--, 9R_1); 
and the recurrence formula (*) yields 


W,(Yo; cane | Pn) = W,-1(¢); Be ety) yp _1) + P"'QDn- 


Thus, to show that y, admits coefficients in Z, it is enough to show that 


Wn (G0, -++1 Pn) = Wn-1(¥o--++n—1) mod (p"). 


Every polynomial y € Z[X,)] satisfies y? = y(X?,Q”) mod (p), as is 
easily seen by applying the reduction homomorphism Z[X,2)] —> F,[X,Q] 
and using 3.1/3; note that ¥?, resp. 9)’, is meant as the system of all pth 
powers of the components of X, resp. 2). In particular, we have 


vt = pi(X?, DY?) mod (p), 1=0,...,n—1, 
which implies by means of Lemma 4 that 


W,-1(¥0) mR ig r-1) = W,r-1(¥o(X?, QP), ee) Paaile’, y”)) mod (p"). 


Now, combining the commutativity of the above diagram with the recurrence 
formula (*), we get the following congruence modulo (p”): 


Wr(Yo,---)Pn) = &(W,,(X), W,(Q)) 
= 8(W,,_1(¥”), Wn-1(Y”)) 
= Wr_1(vo(X?, YP), --- Pn—1(¥?, Y?)) 
= Wr-1(¥)- ++ Pn—1): 


This shows, as explained before, that y, admits coefficients in Z. 


Applying Lemma 5 to the polynomials 


P(¢, €) = ¢ a g, resp. D(C, €) = ¢ : g, 


we obtain corresponding polynomials 
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Sins Pn € Z[X0,---, Xn, Yo,---; Yn], neéeN, 
in place of the y,, where, for example, 
So = Xo + Yo, Sp =X, +N +5(X$ + V9 — (Xo+¥o)?), 


Po = Xo0° Yo, Py = XiYP + X8Y1 + pX1¥1. 


We will utilize the polynomials S,,, P,, in order to define the addition and mul- 
tiplication in Witt rings W(R) over arbitrary rings R. As already mentioned, 
W(R) as a set equals the Cartesian product RN. Therefore, we can introduce 
laws of composition on W(R) by setting 


at+y=(S,(z, 0) cag Ley= (Ee) cies x,y © W(R). 
Looking at Lemma 5, we see that the map 


w: W(R) > RS, z+ (W,(2)) 


neN ? 


satisfies the compatibility relations 
warty)=w(t)+ewly), we y)=u(t)-cwly), tye WR), 


and we can state that w is a ring homomorphism, provided we know that W(R) 
is a ring with respect to the laws of composition given by “+” and “-”. On the 
other hand, if p is invertible in R, we know from Lemma 3 that w is bijective. 
In such a case we get 


aty=w '(w(x) +. w(y)), z-y=w '(w(z)-.w(y)), x,y € W(R), 


and it follows, as already explained, that W(R) is indeed a ring with respect to 
the laws of composition “+” and “-”. In particular, w: W(R) —> RN is a ring 
isomorphism in such a case. 


Proposition 6. Consider an arbitrary ring R. Then the laws of composition 
“4% and “.” derived from the polynomials S,,,P, as given above make W(R) a 
ring with the following properties: 

(i) (0,0,...) € W(R) is the zero element, and (1,0,0,...) € W(R) the unit 
element. 

(ii) w: W(R) — RS, 2 +> (W,(2))ien, is a ring homomorphism, even a 
ring isomorphism if p is invertible in R. 

(iii) For every ring homomorphism f: R —> R’, the induced map 


W(f): W(R) — WR’), (Gn)nen > (f(4n)) news 


is a ring homomorphism as well. 


Proof. First, assume that p is invertible in R. Then, as we have seen, W(R) is 
a ring and w: W(R) —> RN is an isomorphism of rings. Since apparently 


4.10 General Kummer Theory and Witt Vectors* 217 


w(0,0,...)=(0,0,...),  w(1,0,0,...) =(1,1,1,..,), 


it follows that (0,0,...) is the zero element and (1,0,0,...) the unit element in 
W(R). 

Next we look at the particular case that R = Z[] (X,Y, 3] for countably 
infinite systems of variables 


=... DS. 8H 


Then X, 2), 3 may be viewed as elements of W(R), and the associativity condi- 
tions 


(¥4+9)+3=*X+(M+4+3), (D3 =2+Q) <3) 


represent certain polynomial identities among the S,,, resp. P,, where only coef- 
ficients from Z are involved. As a consequence, these identities must hold already 
in the polynomial ring Z[X, 2), 3]. Concerning further ring axioms, we can pro- 
ceed in a similar way. For example, we can apply Lemma 5 for &(¢,€) = —¢ in 
order to see that the addition in W(R) admits a process of forming inverses, 
defined in terms of coefficients in Z. As a consequence, it follows that the laws 
of composition “+” and “.” satisfy, so to speak in a generic way, the axioms of 
a ring structure on W(R), where they are represented by certain formal polyno- 
mial identities with coefficients in Z. If we substitute the variables by values of 
an arbitrary ring R, the laws of composition “+” and “.” retain the properties 
of a ring structure, and we can conclude that W(R) is ring, regardless of the 
type of underlying ring R. 

It still remains to verify assertion (iii). Using the universal property of poly- 
nomial rings as proven in 2.5/1, we can identify W(R) for arbitrary rings R 
with the set Hom(Z[X], R) of all rig homomorphisms Z[X] —> R and like- 
wise W(R) x W(R) with Hom(Z[X, Y], R). Then addition and multiplication 
on W(R) are to be interpreted as the maps 


Hom(Z[X,Q], R) —+ Hom(Z[X],R), proyog, 
that are induced by 
g: Z[X] — Z[X, 9], Xn +> Sy, resp. X, +> Py. 


Furthermore, given a ring homomorphism f: R —> R’, we obtain for the ad- 
dition, as well as for the multiplication, in each case a canonical commutative 
diagram 

Hom(Z[X, 9], R) _— Hom(Z[X], R) 


Hom(Z[X, 2], R’) —s Hom(Z[X] : i), 


where the vertical maps are given by composition with f, and in addition can 
be interpreted as the map 


W(f): W(R) + WR), (an)nen— (F(4n)) nen 
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resp. as the Cartesian product of this map with itself. The commutativity of 
the diagram then corresponds to the homomorphism property of W(f). 


The ring W(R) is called the Witt ring attached to the ring R, and its el- 
ements are referred to as Witt vectors with coefficients in R. For any element 
a € W(R), its image w(a) € RN is called the associated vector of ghost compo- 
nents of a. The reason is that addition and multiplication of these components 
in the usual way (at least in the case that p is invertible in R) determine the 
ring structure of W(R), although these components themselves are not visible 
in W(R). 

We want to add some simple rules for doing computations in W(R). Con- 
sidering again the homomorphism w: W(R) —> RN, we have 


w((a - B,0,0,.. .)) = w((a, 0,0, ie .)) . w((8, 0,0, ae A) 


for elements a, € R, since W,(7,0,0,...) = 7?" for y € R. From this we 
deduce the rule 


(a,0,0,...)« (8,0,0,...) = (a+ B,0,0,...) 


for the multiplication in W(R), in a first step for the case that p is invertible 
in R, and then, using an argument as in the proof of Proposition 6, also for 
arbitrary rings R. In the same way one verifies the decomposition rule 


(dao, @1,---) = (@o,---, Qn, 0,0,...) + (0,---,0, @n4i, Gnte,---) 


for the addition in W(R). In the sequel the multiplication in W(R) by p will be 
of special interest, particularly for the case in which we have p- 1 = 0 in R. For 
example, for R = F, and a € W(F,) we get w(p-a) = p- w(a) = 0, although 
the p-multiplication on W(F,) is nontrivial, as we will see below. In particular, 
this shows that the homomorphism w: W(F,) —> F} cannot be injective. In 
other words, elements in W(F,) are not uniquely characterized by their ghost 
components. 

In order to study the p-multiplication on W(R), we introduce the Frobenius 
operator 


F:W(R) > W(R), (Gp, @1,-..) +> (ab, a?,...), 
as well the Verschiebung operator ® 
V: W(R) — W(R), (dao, @1,---) + (0, a0, @1,.-.). 


Both operators commute with each other, i.e., VoF = FoV. Also note that the 
Frobenius operator F’': W(R) —> W(R) is a ring homomorphism over rings R 
satisfying p- 1 = 0. Indeed, R —> R, at—> a?, is then a ring homomorphism, 
and as we have explained, it induces a ring homomorphism W(R) — W(R), 


° Tt is common to use the German word Verschiebung for this operator; it means shifting. 
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which coincides with F’. The Verschiebung operator V does not admit such a 
property, but is always additive. To justify this, we may assume, similarly as in 
the proof of Proposition 6, that p is invertible in R. Then w: W(R) —> RN is 
an isomorphism, and we have W,,,;(V(a)) = pW,,(a), resp. 


w(V(a)) = (0, pWo(a), pWi(a),..-). 
This means that V is transported via w to the map 
SR, (Zo, %1,---) + (0, pxo, pv1,.--), 


which clearly is componentwise additive. 

Now the p-multiplication on W(R), which in most cases is simply denoted 
by p, can be characterized in terms of the Frobenius and the Verschiebung 
operators as follows: 


Lemma 7. For a € W(R), let (p-a) be the p-fold sum of a in W(R) and 
(p-a)n its associated component of index n. Likewise, let ((V o F’)(a))n be the 
component of (Vo F)(a) of index n. Then 


((VoF)(a)), = (p-a),, mod (p), neNn. 
In particular, if p-1=0 in R, we get the relation 


VoF=FoV=p. 


Proof. Using a similar argument to that applied in the proof of Proposition 6, 
we may assume that p is not a zero divisor in R. Then, by Lemma 4, the stated 
congruences are equivalent to 


W,((V o F)(a)) = W,(p-a) mod p"*?, neN. 
Furthermore, the recurrence formulas (*) imply 
W,((V o F)(a)) = W.(F(V(a)) =Wrii(V(a)) mod gr 
and we get 


W,41(V (a) =p: W,(a) = W,,(p- a). 


Indeed, the left equality of the preceding equation was used above to show that 
V is additive, while the right one follows from the fact that w: W(R) —> RN is 
a homomorphism. Putting everything together, the desired congruences follow. 


In view of Kummer theory in characteristic p > 0 and for an exponent p’, 
we need rings of Witt vectors of finite length r > 1. So far we have considered 
the set RN and have looked, so to speak, at Witt vectors of infinite length. 
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However, we can just as well restrict ourselves to vectors (do,...,@--1) € R" 
of finite length r. Since the polynomials S,,, P,, contain only variables X;, Y; for 
indices 7 < n, they give rise to laws of composition on each R”, and one shows as 
in the case of Witt vectors of infinite length that they define a ring structure on 
R’. The resulting ring is denoted by W,.(R) and is called the ring of Witt vectors 
of length r over R. It is easily checked that the assertions of Proposition 6 carry 
over mutatis mutandis, where W; (J) is canonically isomorphic to R. If V is the 
Verschiebung operator on W(R), as considered above, then the projection 


W(R) — W,(R), (do, @1,---) ++ (o,---, r—1), 
is a surjective ring homomorphism with kernel 
V™W(R) = { (a0, a1,.--) € W(R); ao =... = a1 = 0} 
and hence induces an isomorphism W(R)/V"W(R) > W,(R). In particular, 
V'W(R) is an ideal in W(R). Furthermore, V": W(R) —> W(R) is an in- 
jective homomorphism of additive groups mapping V*W(R) onto V"**W(R) 
for every k € N. Therefore, V" gives rise to an r-fold Verschiebung opera- 


tor Vi: W;,(R) —> W,+(R), which is an injective homomorphism of additive 
groups as well. Clearly, we get 


imV,/ = { (ao, oy Orte—1) © Wrye(R); a9 =... = Gp = O}, 
and this image coincides with the kernel of the projection 
W,+%(R) —? W,(R), (do, eae Or+k—-1) + (do, ae) Qp—1) 


It follows that V’ induces an isomorphism W,+,(R)/VJW;,(R) > W,(R) and 
hence gives rise to an exact sequence of abelian groups 


0 — W,(R) ~& W,4(R) — W,(R) — 0. 
Alternatively, we can consider on W,(R) the map 
W,.(R) “yy W,41(R) —< W,(R), (ao, oS Qp4) -—> (0, aQ,--- 1 Gro), 


as a Verschiebung operator. This operator, in the following denoted by V again, 
is additive, and its kth power V* for 0 < k < r admits 


v"-*W,(R) = { (ao, +; Ar-1) € W,(R); ag =... = Opp 0} 


as its kernel. 


Kummer theory of exponent p”. — Let us consider an exponent p’ for 
r > 1, as well as a field K of characteristic p > 0, together with a separable 
algebraic closure K’, of K and its absolute Galois group G = Gal(K,/). Then 
every Galois automorphism 0: kK, —> K, induces an automorphism of rings 
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W,(K,) —> W,(Ks), (ao,--+,@r—1) + (a(ao),.--,0(ar—1))- 


Thereby we obtain a homomorphism G —> Aut(W,(K,)), which we will view 
as an action of G on W,(K,). This action is continuous, since the action of 
G is continuous on the particular components of W,(.A’,). Thus, writing A for 
the additive group of W,(K,), we see that A is equipped with a continuous 
G-action, where, in the sense of general Kummer theory, we have Ay; = W,(L) 
for intermediate fields L of K,/K. 
Furthermore, 
eg: AA, at—> F(a) —a, 


is an endomorphism of A that is compatible with the G-action on A. 


Theorem 8. Assume chark = p > 0. Then A = W,(K,), viewed as a 
G-module, together with the G-homomorphism 


eg: AA, at—> F(a) —a, 


satisfies the conditions of Theorem 1 for Kummer theory of exponent p" over 
K. 


We divide the proof into several steps. 
Lemma 9. 9: W,(K,) —> W,(K,) is surjective. 


Proof. For r = 1, we get A = W,(K,) = Ks, and it has to be shown that the 
map 

go: K, — Kg, ar aPr—a, 
is surjective. However, this is clear, since polynomials of type X? — X —c, where 
c € K,, are separable. For general r, it is easily checked that ~ is compatible with 
the Verschiebung operator, as well as with the projection W,(K;) —>+ Wi(K,). 
Therefore, if r > 1, we get a commutative diagram 


1 


i= Wate) ee = ey 
0 ——> W,-1(K,) s+ W,(K,) ——> Wi(K,) ——> 0, 
( 


and we can conclude from the surjectivity of @ on W\(K,) and on W,_1(K;) 
that it is surjective on W,(/X,) as well. 


As a next ingredient for Kummer theory, we determine the kernel of g. To 
do this we view F, as the prime subfield of our field K. 


Lemma 10. The kernel of 9: W,(K,) —> W,(K;) satisfies ker 9 = W,(F,). 
This group is cyclic of order p” and generated by the unit element e € W,(F,). 
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Proof. The solutions of the equation «? = x in K, consist precisely of the 
elements of the prime subfield F, C A. Therefore, we get ker ¢ = W,(F,), due 
to the definition of g. Thus, ker 9 is a group of order p”, and we claim that the 
unit element e = (1,0,...,0) € W,(F,) is of this order. Indeed, the order of e 
divides p” and thus is a p-power. Using the formula V o F = p from Lemma 7, 
repeated multiplication by p moves the component 1 in e at each step by one 
position to the right, so that indeed, p” turns out to be the order of e. 


Finally, in order to make Theorem 1| applicable and thereby characterize all 
abelian extensions of an exponent dividing p’, it remains to establish Hilbert’s 
Theorem 90. 


Proposition 11. Let L/K be a finite Galois extension in characteristic p > 0 
with Galois group G. On the ring of Witt vectors W,(L) over L of given length 
r, consider the componentwise action of G. Then 


H"(G,W,(L)) =0, 


i.e., every 1-cocycle is already a 1-coboundary. 


Proof. We proceed similarly as in the proof of 4.8/2, but in addition, must make 
use of the trace map 


troy: W.(L) + W,(K), a+ S-o(a). 


o€G 


Since every o € G defines a W,(A’)-automorphism of W,(L), we see immediately 
that the trace map is W,(4)-linear. In addition, trz/% is compatible with the 
projection W,(L) —> W,(L) = L, where the trace map on W(L) coincides by 
4.7/4 with the usual trace map trz/q%: L —+ K. Proceeding by induction on r, 
we want to show that trz/«: W,(L) —> W,(Jc) is surjective. 

If r = 1, we have to deal with the usual trace map, as defined for finite field 
extensions. The assertion then follows from 4.7/7. Otherwise, we can use the 
fact that the trace map on W,(L) is compatible with the Verschiebung operator, 
and hence for r > 1, leads to a commutative diagram of the following type: 

Vay 


0 — > W,_1(L) —> W,(L) ——> W,(L) — 0 


try/K | trysK | tresK | 


yi 
0 —> W,_1(K) ——> W,(K) ——> W,(K) ——> 0 
As we know, the trace map is surjective on W,(L), and by the induction hy- 
pothesis, also on W,_1(L). Therefore, it will be surjective on W,(L) as well. In 


particular, there exists an element a € W,(L) such that trz/«(a) = 1. 
Now let f: G —> W,(L) be a 1-cocycle. Considering the Poincaré series 


b= 5° f(o')-o'(a), 


o'EG 
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we obtain for arbitrary o € G the equation 


o(b) = S) o(f(0')) (7 00')(a) 


a'EG 

= (Flo!) = flo) «(0 o0'V(a) 
a'EG 

= JT f(oo0")-(¢00\(a)— > fo): (co 0)(a) 
eG o'EG 


=b-— f(o) : trr/K (a) =b— f(o), 


i.e., f is a 1-coboundary. 


This concludes the proof of Theorem 8. 


Exercises 
1. Within the context of general Kummer theory for some exponent n, characterize 

all cyclic Galois extensions of a degree dividing n. 

2. Let K be a perfect field of characteristic p > 0. Prove the following properties of 

the Witt ring W(K): 

(i) The map 

K* — W(K)*, at (a,0,0,...), 
is a monomorphism of multiplicative groups. Is a similar assertion valid for 
the additive group K as well? 

(ii) The canonical map W(K) — lim W (ic) /p"W (kK) is an isomorphism of 
rings. In particular, W(F,) coincides with the ring Zp of integral p-adic 
numbers; see Section 4.2. 

(iii) W(K) is a principal ideal domain with maximal ideal p-W(K) = V'!W(K). 
Every other nontrivial ideal in W(K) is a power of this maximal ideal and 
hence is of type p":W(K)=V"W(K). 

3. Let p be a prime number and q = p’ a nontrivial power of p. Show: 
(i) Every a € W(F,) admits a representation 
iEN 
with unique coefficients c; € F, that are to be interpreted as Witt vectors 
(ci, 0,0, nee .) €E W(F,)- 

(ii) W(F,) = Z,[¢] for a primitive (¢ — 1)th root of unity ¢. 

Furthermore, determine the degree of the field of fractions Q(W(F,)) over Q(Z,). 

4. Let G be the absolute Galois group of a field kK. Using the notions of general 

Kummer theory, consider for a G-module A the maps 

B: Ar G(A/A), Ww: H+ AP, 
for subgroups A C A and H C G. Show that 
PoWo P(A) = P(A), Vo@oW(H) =W(H). 
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4.11 Galois Descent* 


Let K’/K be a field extension. Given a K-vector space V, say with basis (v;)ier, 
we can extend coefficients and construct from V a K’-vector space V' = V@x kK’, 
for example by viewing (v;)icr as a K’-basis and admitting coefficients from A’. 
Then V is called a K-form of V’. In a similar way, it is possible to derive 
from a K-homomorphism y: V —> W a K’-homomorphism vy’: V’ — W’ 
by means of extending coefficients. The subject of descent theory for K’/Kk 
is the reverse problem. Its aim is to describe K-vector spaces and their ho- 
momorphisms in terms of the corresponding extended objects over K’ together 
with so-called descent data on them. It is quite easy to specify K-forms V, W of 
given K’-vector spaces V’, W’. However, for a K’-homomorphism y’: V’ —> W’ 
and fixed K-forms V, W, it is not true in general that y’ is obtained from a 
K-homomorphism y: V —> W by extending coefficients. For this to work well 
it is necessary that y’ respect the descent data given on V’ and W’. 


In the present section we will study descent theory only for the case that 
is the fixed field of a finite group of automorphisms of K'/K, i.e., for finite Galois 
extensions A’’/K; cf. 4.1/4. Then the necessary descent data can be described 
in terms of group actions. However, let us point out that in algebraic geometry, 
descent theory is developed in much more generality. For example, consult the 
foundational work of Grothendieck [6], or see [3], Chap. 4. 

Before we actually start studying descent, let us put the process of coeffi- 
cient extension onto a solid basis, by introducing tensor products. We restrict 
ourselves to the special case of vector spaces. More general tensor products will 
be dealt with in Section 7.2. 


Definition 1. Let K'/K be a field extension and V a K-vector space. A tensor 
product of K' with V over K is a K'-vector space V', together with a K-linear 
map T: V —>+ V', admitting the following universal property: 

Given a K-linear map y: V —> W' into a K'-vector space W’, there exists 
a unique K'-homomorphism y': V' —+ W’' such that p = yor, in other words, 
such that y' is a K'-linear “continuation” of . 


Due to the defining universal property, tensor products are unique, up to 
canonical isomorphism. In the situation of the preceding definition, one writes 
K' @x V or V @x K’ for V’, depending on whether one likes to view V’ as a 
left or a right vector space under the scalar multiplication by elements from K’. 
Furthermore, for (a,v) € K' x V, the product a-7(v) is usually denoted by 
a@® v; this element is called a tensor. The elements of K'’ ®x V are finite sums 
of such tensors, as we will see further below. Of course, the corresponding fact 
remains true if V’ is viewed as a right vector space. 


Remark 2. The tensor product V' = K' ®@x V, as specified in Definition 1, 
always exists. 
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Proof. Fix a K-basis (v;)icr of V and consider the K’-vector space V’ = Kk!) 
with its canonical basis (e;);¢7. Mapping the basis vector v; € V to the basis 
vector e; € K’) for each i € J, and using K-linear extension, we obtain an 
injective K-linear map tT: V —> V’. Now let y: V — W' bea K-linear map to 
an arbitrary K'-vector space W’. If there exists a K’-linear map y’: V’ —> W' 
satisfying yp = y’ oT, then we get necessarily y/(e;) = y'(T(v;)) = v(v;). In 
particular, y’ is uniquely determined by y on the K’-basis (e;) of V’, and thereby 
as a K'-linear map, on all of V’. On the other hand, we can define a K’-linear 
map y’: V’ —> W’ satisfying y = y’ or by mapping e; +> y(v;) and using 
K'-linear extension. It follows that V’ together with the map 7 admits the 
properties of a tensor product of A’ with V over K. 


The proof shows that indeed, V’ = K' @x V arises from V by “extending 
coefficients on V.” Using the injective K-linear map 7: V —> V’ = K’® x V as 
an identification, we may view V as a K-linear subspace of K’@, V. In the proof 
we have fixed a K-basis (v;)icer of V and defined the tensor product K’ @ x V in 
such a way that it admits the same system (v;);er as a K'-basis. Also note that 
the resulting tensor product K’@,V is independent of the chosen K-basis (v;)je7 
of V, due to the universal property of the tensor product. Furthermore, one can 
easily show that every K-homomorphism y: V —> W between K-vector spaces 
V and W extends to a K’-homomorphism (K’ ® y): K'@x V —> K' @xK W 
between the corresponding K’-vector spaces. However, this fact can just as well 
be seen using the universal property of tensor products, since 


V — K'@x W, v->+1@(v), 


is a K-linear map and hence gives rise to a well-defined K’-homomorphism 
(K'@y): K'@xV — K'®@xW. 

We are now able to make the terms “A-form” and “defined over K,” as 
used above, more precise. To this end, consider any field extension K’/K. A 
K-linear subspace V of a K’-vector space V’ is called a K-form of V’ if the 
K'-linear map K' @x V —+ V’ induced by V <@ V’ is an isomorphism. Fixing 
a K-form V of V’, the preceding isomorphism is usually viewed as an identi- 
fication. Furthermore, a K’-linear subspace U’ C V’ is said to be defined over 
K if U' is the K'-extension of a K-linear subspace U C V, or in other words, 
if there exists a K-linear subspace U — V such that the induced A’-linear 
map K' @x U —>+ K' @x V = V’ (it is always injective!) identifies K’ @% U 
with U’. In particular, U is then a K-form of U’. Finally, a K’-homomorphism 
vy’: V' —>+ W’ between K’'-vector spaces with K-forms V and W is said to be 
defined over K if y' is the K’-extension of a K-homomorphism vy: V — W, 
i.e., if there exists a K-homomorphism y: V — W such that y’, using the 
identifications V’ = kK’ @x V and W’ = K’' ®x W, coincides with K’ @ y. 


10 Tn dealing with general tensor products, it is common practice to use the notation idg @y 
instead of K’@vy. Actually, we are dealing here with the tensor product of two K-linear maps, 
namely the identity map on K’ and the map yg; see also Section 7.2. 
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Next we want to approach the case of finite Galois extensions K’/K, assum- 
ing however, for the moment only that K is the fixed field of a (not necessarily 
finite) subgroup G C Aut(Kk’); see 4.1/4. Consider a K'-vector space V’ to- 
gether with a K-form V, where we identify V’ with A’ @x% V. Then we can 
define for every 0 € Ga K-linear map f,: K’ @x V —> K' ®x V, namely the 
one characterized by a ® v +> o(a) ® v. Indeed, fixing a K-basis (v;)ier of V 
and viewing it as a K’-basis of V’ as well, f, can be defined by 


fo. Vi SV’, > ros » o(aj)vi- 
The map f, is called o-linear, since it satisfies the relations 
fov'tw')=folv’)+ fw’), — fola'v') = o(@') fo(v'), 


for v',w’ € V' and a’ € K’. Furthermore, we have f, o f, = f.; for 0,7 € G, as 
well as f. = idy’ for the unit element ¢ € G. This means that the maps f, give 
rise to an action 


GxV'—SV', (o,v) —+ fo(v), 


of G on V’ in the sense of 5.1/1. This action, characterized by f = (fs)cea, is 
referred to as the canonical G-action attached to the K-form V of V’. 


Proposition 3. Let K'/K be a field extension such that K is the fixed field of 
a subgroup G C Aut(K"). Furthermore, consider a K'-vector space V', together 
with a K-form V and its corresponding canonical G-action f. 

(i) An element v € V' belongs to V if and only if f,(v) =v for allo €G. 

(ii) A K'-linear subspace U' C V’ is defined over K if and only if we have 
fo(U’) CU' for allo EG. 

(iii) A K’-homomorphism vy’: V' —+» W’' between K'-vector spaces with 
K-forms V, W and corresponding G-actions f, g, is defined over K if and 
only if y! is compatible with all a € G, i.e., if and only if '(f.(v)) = go(y'(v)) 
for alla €G andallv eV’. 


Proof. Assertion (i) is easy to obtain. Fix a K-basis (v;);cr of V and write 
v = )0,a,v; with coefficients a; € K'. Since f,(v) = >0,0(a;:)vi, we see that 
v is invariant under all f, if and only if the coefficients a; are invariant under 
allo € G, ie., if and only if all a; belong to AK and hence v is an element 
of V. Just as easily we can derive assertion (iii). Certainly, the compatibility 
condition given in (iii) is necessary. On the other hand, the condition implies 
yg (V) Cc W if we use (i). 

Turning to assertion (ii), the condition f,(U’) C U’ for all o € G is clearly 
necessary. To see that it is also sufficient, consider a K-basis (v;)icer of V, as well 
as the residue classes 0; € W’ = V’/U' of the elements v;. There is a subsystem 
(U;)icr of the system of all 0; forming a K’-basis of W’. Therefore, we can view 
W = Vien KU; as a K-form of W’ and consider on W’ the G-action g that is 
attached to W. We claim that the canonical projection y’: V’ —> W’ is defined 
over KX. To justify this, observe that every v € V’ can be written as 
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v=ut ) AjUi, 


ier! 
where u € U' anda; € K’ for alli € I’. Now use the fact that by our assumption, 
f.(U') C U' = kery’ for all o € G, and that f,(v;) = v; for all i € I’. This 
shows that y'(fs(v)) = go(y’(v)) for all a € G and hence that y’ is defined 
over FX, due to (iii). Then it is not difficult to see that together with y’, also 
U' = ker y’ is defined over K. 


Now we want to show that K-forms of vector spaces can be characterized 
in terms of group actions. 


Proposition 4. Let K'/K be a field extension such that K is the fixed field of 
a subgroup G C Aut(K’). Furthermore, consider a K'-vector space V'. For each 
a €G, let fz: V’ —> V' be a o-linear map satisfying f,° f; = for for 0,7 € G, 
as well as fz = idy for the unit element ¢ € G. Thus, the maps f, set up an 
action f = (fz) of Gon V’. Let V CV’ be the corresponding fixed set. 

(i) V is a K-linear subspace of V', and A: V @ V" induces a K'-linear 
map ': K' @~ V —> V', which is injective. 

(ii) If G is finite and hence K'/K a finite Galois extension, then X' is sur- 
jective and therefore bijective. In particular, V is a K-form of V’. 


Proof. Of course, V is a K-form of K' @x V for trivial reasons. Let h be the 
canonical action of G on K’@x V, where hs: K'@xK V —> K' @ x V is charac- 
terized by a®vu+-+ a(a) ®@ v. Then X’ is compatible with the actions h and f, 
since we have 


N (he(a® v)) = A’ (a(a) @v) = o(a)u = fo(av) = fo(N(a®v)). 


This implies h,(ker \’) C ker A’, and we conclude from Proposition 3 (ii) that 
ker \’ is defined over kK. Hence, there is a K-linear subspace U C V whose 
K'-extension in Kk’ @x V coincides with ker X’. However, for u € U we have 
u = X(u) = A’(u) = O and hence u = 0, so that » is injective. This settles 
assertion (i). 

To verify (ii), assume that Gis finite. It is enough to show that every linear 
functional y’: V’ —> K’ vanishing on V is identically zero on V’. Therefore, 
consider such a linear functional vy’, where y’(V) = 0, and let v € V’. Then, for 
variable a € K’, the elements Ug = )>,¢¢ fo(av) are invariant under the action of 
G on V’ and thus belong to V. Since y'(V) = 0, we get \o<g a(a)y’(fa(v)) = 0 
for all a € Kk’. Now view the preceding sum as a linear combination of the 
characters o € G and apply the linear independence result 4.6/2. It shows that 
all coefficients y'(f.(v)) € K’ vanish. In particular, for o = ¢, the unit element 
of G, we obtain y’(v) = 0. Therefore, every linear functional on V’ that is trivial 
on V is identically zero on V’. 


As a summary, we can learn from Propositions 3 and 4 for a finite Galois 
extension K'/K with Galois group G that the theory of K-vector spaces is 
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equivalent to the theory of K’-vector spaces with G-actions, as studied in this 
section. Within this setup, K-homomorphisms of K-vector spaces correspond 
to those K’-homomorphisms between attached A’-objects that are compatible 
with the G-actions under consideration. In particular, the G-actions play the 
role of the descent data, which were mentioned earlier. 

Finally, let us point out that the linear independence of characters 4.6/2 was 
used in the proof of Proposition 4 (ii) in a similar way to what we did in the 
proof of the cohomological version of Hilbert’s Theorem 90; see 4.8/2. Moreover, 
4.8/2 implies for a finite Galois extension K’/K the assertion of Proposition 4 
in the case dimg, V’ = 1, resp. V’ = K’. Just check for fixed v € K”™ that the 
map 


G— K™, or> fo(v) 
v 


d 


is a l-cocycle, and thus a 1-coboundary by 4.8/2. Therefore, there exists an 
element a € K™ such that f,(v)- v7! =a-o(a)"1, and we get 


f.(av) = o(a)- f.(v) = ar, 
i.e., av € V’ is fixed by all f,. Then it is easily seen that V = K - av equals the 
fixed set of the action of G on V’ and hence that it is a A-form of V’. 


Exercises 


1. Let K'/K be a field extension and A a K-algebra, i.e., a ring together with a ring 
homomorphism K —> A. Show that A@x K"' is naturally a K'-algebra. 


2. Give an alternative proof of Proposition 4 as follows. Use an inductive argu- 
ment to verify assertion (i) in a direct way. To establish (ii), choose a K-basis 


Q4,.--;Qn of K' and show that every v € V' admits a representation of type 
n 
v= a(S folaiv)) 
i=l ocEG 


with coefficients c; € K'. 


3. Let K'/K be a field extension and assume that K is the fixed field of a subgroup 
G c Aut(K’). Furthermore, consider a K’-vector space V’. For every o € G 
introduce a K’-vector space V/ as follows. Take V’ as its additive group, and 
define the scalar multiplication by a-v := o(a)v for a € K' and v € V', where 
the product on the right is meant in the sense of the K’-vector space V’. Then 
view the diagonal embedding \: V’ —> [eq Vz as a K-linear map, and look at 
the induced K’-linear map A: V' @x K' — [J eq Vj. Show that A is injective, 
and even bijective if [K’: K] < oo. Hint: Introduce a suitable action of G on 
Il-eg Vo such that V’ is the corresponding fixed set. 


4. Let K'/K be a field extension and V a K-vector space. Consider the K-linear 
maps 
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VV Gx K', veESve@l, 
Vek K' SV GK K' Ox K’, v@ar>vea@l, 
V @k K'’ SV @xK K' @x K', v@ar>velea, 


and show that the diagram 
V>V@x K'SV @x K' @x Kk’ 
is exact in the sense that the map on the left is injective and its image equals the 


kernel of the difference of the two maps on the right. 


. Let K'/K be a finite Galois extension with Galois group G. In the setting of 
Exercise 4 write V’ = V @x K’' and identify V’ @x K’ with [],¢q Vj in the sense 
of Exercise 3. Then describe the two maps V’ = |],-¢ Vj of Exercise 4 in as 
simple a way as possible. 


5. More Group Theory ®) 


Check for 
updates 


Background and Overview 


Returning for a moment to the problem of solving algebraic equations, let us 
look at a monic polynomial f € K[X] with coefficients in a field AK’. Further- 
more, let L be a splitting field of f, where we assume that L/K is separable. 
Solving the algebraic equation f(7) = 0 by radicals amounts to constructing a 
chain of fields of type 


(*) Rake Rew k, 


such that LZ C K,, and in each case, Kj, is obtained from K; by adjoining a 
root of some element in K;. Indeed, it is precisely if such a chain exists that 
the solutions of the equation f(x) = 0 that generate the extension L/K can 
be characterized in terms of rational operations on elements of K, combined 
with the process of extracting roots. To simplify, let us assume in the following 
that the extension K,./K is Galois. Then the fundamental theorem of Galois 
theory 4.1/6 is applicable, and every chain of fields of type (*) is equivalent to 
a chain of subgroups 


(#*) Gal(K,/K) = Go 2G 2... 2G, = {1}. 


Furthermore, in 4.5 and 4.8 we characterized extensions that arise through the 
adjunction of nth roots. If we restrict ourselves to fields of characteristic 0 and 
assume that AK contains sufficiently many roots of unity, we can conclude from 
4.8/3 and 4.1/6 that a chain of fields as in («) is given by successively adjoining 
nth roots of elements for variable n if and only if the corresponding chain (**) 
admits the property that in each case, Gj, is a normal subgroup of G; and the 
residue class group G;/Gj41 is cyclic. More precisely, we will see in 6.1 that the 
equation f(x) = 0 is solvable by radicals if and only if there exists a chain (**) 
admitting these properties for the Galois group Gal(L/K). 

The above considerations in terms of Galois theory show that the problem 
of solving algebraic equations by radicals can be reduced to a group-theoretic 
problem. For example, from the fundamental theorem for finitely generated 
abelian groups 2.9/9 we conclude that algebraic equations with abelian Galois 
group are always solvable. However, to arrive at more specific results on the 
solvability of algebraic equations it is necessary to further complete the theory 
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of finite (not necessarily commutative) groups. In particular, we want to char- 
acterize all groups G admitting a chain of subgroups (**) such that as before, 
Gi41 is a normal subgroup in G; and the residue class group G;/Gi41 is cyclic. 
Such a group G is called solvable, where instead of “cyclic” we can just as well 
require the quotients G;/Gj41 to be abelian; see 5.4/3 and 5.4/7. 

In order to approach the subject of solvable groups, we start in 5.1 with some 
basic material on group actions. Interpreting the Galois group of an algebraic 
equation f(z) = 0 as a group of permutations on the corresponding set of 
solutions, cf. 4.3/1, we get a prototype of such an action. Using the concept of 
group actions, we prove in 5.2 the so-called Sylow theorems on finite groups, 
named after the mathematician L. Sylow. They provide information on the 
existence of subgroups whose order is a prime power. In special situations, Sylow 
theorems can be used to check whether a given group is solvable. Furthermore, 
we have assembled in 5.3 some basic facts on permutation groups, while finally, 
in 5.4 we study solvable groups. In particular, we prove that the symmetric 
group G,, is not solvable for n > 5, which will imply in 6.1 that the generic 
equation of degree n is not solvable by radicals for n > 5. 


5.1 Group Actions 


In the chapter on Galois theory we have already worked with group actions. 
However, this concept was not introduced explicitly, since we had to consider 
only the canonical action of a Galois group on its field or on the zero set of a 
polynomial. In the following we want to free ourselves from such a restricted 
setting and establish some combinatorial properties for general group actions. 


Definition 1. Let G be a (multiplicatively written) group and X a set. An 
action or an operation of G on X is a map 


Gx X +X, (g,t) > 9-2, 


such, that: 
(i) 1-2 =< for the unit element 1 © G and for elements x € X. 
(ii) (gh)-a2=g-(h-a) forgheG, «Ex. 


To begin with, let us list some examples of group actions. 


(1) For a group G and a set X, there is always the trivial action of G on X. 
It is given by the map 


Gx X — X, (g,x) +> @. 


(2) Let X be a set and write S(X) for the group of bijective self-maps 
X —> X. Then every subgroup G C S(X) acts on X via the map 


Gx X — X, (0, 2%) + o(2). 
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In particular, we can consider for a Galois extension L/K the action of the 
Galois group Gal(L/K) = Aut, (ZL) on L. This action was studied thoroughly 
in Chapter 4, on Galois theory. 


(3) For every group G the group multiplication 
GxG—G, (g,h) -> gh, 


may be viewed as an action of G on itself. In fact, G acts on itself via left 
translation, where, as mentioned earlier, the left translation by g € G is given 
by the map 

Tg. G—>G, hr gh. 


Similarly, we can use the right translation to define an action of G on itself, 
namely via 


GxG—G, (g,h) 3 hg. 
Looking at g instead of g~', the map 


76>, he-+> hg, 


is called the right translation by g on G. 


(4) Another action of G on itself is given by the conjugation action 
GxG—G, (g,h) +--+ ghg™". 


The map 
inty = Ty © ee :G—G, hr > ghg", 


is a group automorphism of G, the conjugation with g. Automorphisms of type 
int, are called inner automorphisms of G (“int” stands for “interior”), and 
it is easily checked that the canonical map G —+> Aut(G), g +--+ int,, is 
a group homomorphism. Two elements h,h’ € G are said to be conjugate if 
there exists an element g € G such that h’ = int,(h). In the same way, two 
subgroups H, H’ C G are called conjugate if there is an element g € G such 
that H' = int,(H). The relation of being conjugate is an equivalence relation 
for elements or subgroups in G.. Of course, the conjugation action is trivial if G 
is commutative. 


Similarly as in (3) we can define for a group action 
Gx X +X 
its (left) translation by an element g € G by 
Tg: X —> X, ELH ga. 


The family of translations (T,) eq fully characterizes a given action of G on X. In 
addition, G —> S(X), g++ 7,, is a group homomorphism, as is easily checked. 
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On the other hand, proceeding as in example (2), every group homomorphism 
yp: G —> S(X) gives rise to an action of G on X, namely to 


Gx X +X, (9,2) ++ y(g)(z). 


Both mappings are inverse to each other and we see that the following is true: 


Remark 2. Let G be a group and X a set. Then, using the above mappings, the 
group actions Gx X —+ X correspond bijectively to the group homomorphisms 
G— S(X). 


If we consider for a group G the action G x G —> G via left translation, 
then the corresponding group homomorphism G —> S(G) is injective, since 
Tg = Tg’ is equivalent to g = g’. In particular, we can view G as a subgroup 


of S(G). 


Definition 3. Let Gx X —> X be an action of a group G on a set X. The 
following notions are commonly used for points x © X: 

(i) Ga := {gx; g © G} is called the orbit of x in G. 

(ii) G, := {g € G; gu = x} is called the stabilizer or isotropy subgroup of 
xin G. 


That G, is a subgroup of G is easily checked. Indeed, note that G, contains 
the unit element of G and that for g,h € G satisfying gx = x = hx we have 


(gh')a = (gh')(hx) = g(h-'(hx)) = g((h'h)z) = gr =x. 


Remark 4. Let G x X —> X be an action of a group G on a set X. If x,y 
are two points of one and the same G-orbit in X, then the stabilizer subgroups 
G,,Gy C G are conjugate. 


Proof. It is enough to consider the case y € Gx. Therefore, let h € G be an 
element such that y = ha. Then, for g € Gz, we get 


(hgh"")y = (hgh“")ha = A(gr) = ha =y 


and hence hgh~' € Gy, so that hG,h~' C Gy. Likewise, we can derive from 
« = hy the inclusion h-'G,h C G,, showing that in fact, G, = hG,h~'. 


Furthermore, we want to show that two orbits Ga, Gy C X coincide as soon 
as Gz Gy # Q. Indeed, if there is an element z € Grn Gy, say z = gx = hy 
for some elements g,h € G, we get x = g-!z = g thy and therefore Gx C Gy. 
Similarly we obtain Gx D> Gy and thus Gx = Gy. Thereby we obtain the 
following result: 


Remark 5. Let G x X —> X be an action of a group G on a set X. Then X 
is the disjoint union of its G-orbits. 
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Given a group action G x X —> X and a G-orbit B C X, every element 
x € B will be referred to as a representative of this orbit. Likewise, a system 
of representatives of a family (B;)ier of disjoint G-orbits is a family (2;);er of 
elements of X such that x; € B; for alli € J. The action Gx X —+ X is called 
transitive if there is only a single G-orbit. 

We want to characterize the orbits of a group action in more specific terms. 
As usual, ord M denotes the number of elements of a set WV, and (G : H) denotes 
the index of a subgroup H in a group G. 


Remark 6 (Orbit-stabilizer lemma). Let G x X —+ X be a group action. For 
a point x € X, the map G —> X, g > ga, induces a bijection G/G, “> Gx 
from the set of left cosets of G modulo the stabilizer subgroup Gz, onto the orbit 
of x under G. In particular, we get 


ord Ga = ord G/G, = (G:G,). 


Proof. Look at the surjective map 
yp: G—> Gr, gr gz, 


and observe for g,h € G the following equivalences: 


y(g) = v(h) gu = hx ho gr2=2< > hg €G, 
<> gG, =hG;. 


This shows that y induces, similarly as in the case of the fundamental theorem 
on homomorphisms 1.2/7, a bijection G/G, “> Gz. 


As a direct consequence, we can conclude the following from Remark 5 and 
Remark 6: 


Proposition 7 (Orbit equation). Let G x X —>+ X be an action of a group G 
on a finite set X, and let x1,...,2%» be a system of representatives of the orbits 
of X. Then 


n 


ord X = Ser = SG + G5). 


i=1 i=l 


We will apply the orbit equation especially for X = G and the conjuga- 
tion action G x G —+ G, in order to derive the so-called class equation; see 
Proposition 9 below. In the following let G be a group and S C G a subset. The 
centralizer of S in G is given by 


Z5 ={x EG; as = sx for alls € S}. 
Furthermore, the center of G is defined as the centralizer of G, i.e., as 


Z= Zo = {2 € G; 2s = sz for all s € G}. 
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Finally, the normalizer of S in G is given by 
Ns ={x EG; ao = Sz}. 


Remark 8. (i) Z is a normal subgroup in G. 

(ii) Zg and Ng are subgroups in G. 

(iii) If S is a subgroup in G, then Ng is the largest of all subgroups H CG 
such that S is a normal subgroup of H. 


All these assertions are easy to check. As an example, let us consider the 
case Zg in (ii). If S consists of a single element s, then Zs = Ng equals the 
stabilizer group of s with respect to the conjugation action on G. Thereby we 
see for general S that Zg = (),<g Z{s} is a subgroup of G. Also note that always 
Zs C Ng. 


Proposition 9 (Class equation). Let G be a finite group with center Z. Fur- 
thermore, consider the conjugation action on G and let 11,...,% be a system 
of representatives of the orbits contained in G—Z. Then 


ordG = ord Z + SG : Fei) 


i=1 


Proof. The orbit of an element « € Z consists only of the element x itself. 
On the other hand, we can identify the orbit of an element x € G—Z with 
G/Z,a}; cf. Remark 6. Therefore, the assertion follows from the orbit equation 
in Proposition 7. 


Finally, let us add two results on the center Z of a group G. Since Z equals 
the kernel of the homomorphism 


G — Aut(G), gt int,, 


we can conclude the following from the fundamental theorem on homomor- 
phisms in the version of 1.2/7: 


Remark 10. The group of inner automorphisms of G is isomorphic to G/Z. 
Remark 11. If G/Z is cyclic, then G is abelian. 


Proof. Fix an element a € G such that G/Z is generated by the residue class 
a of a. Furthermore, consider elements g,h € G with residue classes 7 = a, 
h =a". Then there are elements b,c € Z such that g = ab, h = a"c. Since 

mn bo. 


gh =a"™ba"c =a ig= roar bog’ ™ baa te, 


it follows that gh = hg. 
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Exercises 


1. 


Let G be a finite group and H CG a subgroup. Consider the action of H on G 
via left translation (resp. right translation) and interpret the corresponding orbit 
equation in terms of elementary group theory. 


. Let L/K be a finite Galois extension with Galois group G. Consider the natural 


action of G on L and interpret the stabilizer group Gq for a € L, as well as the 
orbit Ga, in terms of Galois theory. Furthermore, determine the orders of Ga 
and Ga. 


Let G be a group and X the set of all subgroups of G. Show: 


(i) Gx X —+ X, (g,H) + gHg"', defines an action of G on X. 
(ii) The orbit of an element H € X consists of H itself if and only if H is a 
normal subgroup of G. 
(iii) If the order of G is a power of a prime number p, then the number of 
subgroups in G differs from the number of normal subgroups in G by a 
multiple of p. 


Let G be a finite group, H a subgroup, and Ny its normalizer. Furthermore, 
write M := Ugeg gHg~' and show: 

(i) ord M < (G: Ny) - ord dH. 

(ii) H AG implies M AG. 
Let G be a group, H a subgroup, as well as Ny and Zy the corresponding 
normalizer and centralizer of H in G. Show that Zz is a normal subgroup of Ny 


and that the group Ny/Zy is isomorphic to a subgroup of the automorphism 
group Aut(H). 


Burnside’s lemma: Let G x X —>+ X be an action of a finite group G on a set 
X. Write X/G for the set of orbits, as well as X9 = {x € X ; gx = x} for the set 
of elements in X that are left fixed by an element g € G. Show that 


ie 
ord(X/G) = od G’ S> ord X9. 
gEG 


5.2 Sylow Groups 


The fundamental theorem of finitely generated abelian groups 2.9/9 gives precise 
information on the structure of such groups, in particular, on the structure of 
finite abelian groups. In the following we will study finite groups without the 
commutativity condition. Our main objective is to derive the theorems named 
after L. Sylow on the existence of certain subgroups, called p-Sylow subgroups 
(or Sylow p-subgroups). We start by introducing the notion of Sylow groups, 
and in particular of p-groups. 


Definition 1. Let G be a finite group and p a prime number. 


(i) G is called a p-group if the order of G is a power of p. 
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(ii) A subgroup H C G is called a p-Sylow subgroup if H is a p-group such 
that p does not divide the index (G : H), in other words, if there exist integers 
k,m €N satisfying ord H = p*, as well as ord G = p*m, where ptm (use 1.2/3). 


It follows from the theorem of Lagrange 1.2/3 that the order of every element 
of a p-group is a power of p. Similarly, the same result shows that a p-Sylow 
subgroup cannot be strictly contained in a p-subgroup of G and therefore is a 
maximal p-subgroup of G. The converse of this fact will follow later from the 
Sylow theorems; cf. Corollary 11. The trivial subgroup {1} C G is an example 
of a p-group, and for pf{ordG, even an example of a p-Sylow subgroup in G. 
Furthermore, we can read from the fundamental theorem 2.9/9 that a finite 
abelian group G contains a unique p-Sylow subgroup S, 4 {1} for each prime 
p dividing ordG and furthermore, that G is the direct sum of all these Sylow 
subgroups. Although this is not needed later on, let us illustrate in Remark 2 
below how to prove the existence of Sylow subgroups in this case by means 
of elementary arguments. On the other hand, Remark 2 can be viewed as a 
simple consequence of the Sylow theorems, once they have been established; cf. 
Exercise 1. 


Remark 2. Let G be a finite abelian group. For every prime number p, there 
is exactly one p-Sylow subgroup in G, namely 


S, ={a€G; a” =1 for some t € N}. 


Proof. First we have to show that 5S, is a subgroup of G. To do this consider 


elements a,b € S,, say where a?’ =1 and b” =1, and write t = max(t’,t”). 
Using the commutativity of G, we get 


(ab!) =a .bP =1 


and thus ab~' € 9,. Since we have 1 € S;, anyway, S,, is indeed a subgroup of 
G. By its definition, S, contains all elements in G' whose order is a power of p. 
In particular, it will contain all p-subgroups of G. Thus, if we can show that S, 
is a p-Sylow subgroup of G, it will be unique. 

To justify that S, is a p-Sylow group, we proceed by induction on n = ord G. 
For n = 1 nothing has to be shown. Therefore, assume n > 1 and choose an 
element x # 1 in G. Replacing x by a suitable power of itself, we may assume 
that q = ord is prime. Then look at the cyclic subgroup (x) C G generated by 
x, and consider the projection 7: G — G’ = G/(x), where ord G’ = ;ordG 
by the theorem of Lagrange 1.2/3. 

If S’ CG’ denotes the subgroup consisting of all elements in G" whose order 
is a power of p, we know from the induction hypothesis that S) is a p-Sylow 
subgroup in G’. Furthermore, 7(.S,) C 5S), and we claim that we have even 
™(Sp) = Si. To justify this, consider an element @ € S’, with 7-preimage a € G. 
If p' is the order of @, we get a” € (x) and hence a?4 = 1. For p = q we can 
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conclude that a € 5,. On the other hand, if p # g, we see that p and q are 
relatively prime. Hence, there is an equation rp’ + sq = 1 for some integers r, s. 
This implies 


t t 
cra) =qt=C? aaa? =o, 


In addition, we have a*4 € S,, since a’? = 1. Thus, in either case, 7 induces a 
surjective map 7,: S, —> S’, satisfying ker 7 = (x) MN Sp. 

Now let n = ordG = p*m, where p{m. If p = q, the order of G’ is given 
by ord G’ = 5ordG = p*~!m, and we read ord S), = p*~! from the induction 
hypothesis. Furthermore, we have (x) C S, and therefore ker 7, = (x), so that 77, 
induces an isomorphism S,,/(x) + S’,. This shows that ord S, = p-ord S’, = p* 
by 1.2/3, and it follows that S, is a p-Sylow subgroup in G. On the other hand, 
if p ~ q, we have ord G’ = p*. ™ and hence ord 57, = p*. Since (x) cannot contain 
an element whose order is a nontrivial p-power, we get ker 7, = (x) NS, = {1}, 
and it follows that 7, restricts to an isomorphism S, —> ve In particular, 
ord Sp = ord S), = p*, so that also in this case, S, is a p-Sylow subgroup in G. 


In the noncommutative case, the theory of p-groups, and in particular of 
p-Sylow groups, is more complicated. We start by considering p-groups. 


Proposition 3. Let G be a p-group of order p*, for a prime number p and an 
exponent k > 1. Then p divides the order of the center Z of G, so that Z # {1}. 


Proof. Look at the class equation 5.1/9 for the conjugation action of G on itself 


ordG = ord Z+ S°(G: Zn3), 

i=l 
where 21,...,, is a system of representatives of the G-orbits in G — Z. By 
1.2/3, the index (G : Z{z,}) is a p-power for each i, since ordG is a p-power. 
Furthermore, (G : Z{z,}) is even a nontrivial p-power, since Z;,,; is a proper 
subgroup of G, due to x; ¢ Z. Consequently, we get p|ord Z. 


Corollary 4. Let G be a p-group of order p*, for a prime number p. Then there 
is a descending chain of subgroups 


G=G,D Gy-1D...DGo= {1} 


such that ord G; = p* and Gy_, is a normal subgroup in Gy for £=1,...,k.4 
In particular, for every divisor p’ of p* there is a p-subgroup H C G such 
that ord H = p‘. If k > 1, it follows that G admits an element of order p. 


Proof. We conclude by induction on k, the case k = 0 being trivial. Therefore, 
assume k > 0. Applying Proposition 3, the center Z C G is nontrivial and there 


! The quotients G/G_1 are of order p and hence cyclic as well as abelian. Thereby it is 
seen that every finite p-group G is solvable in the sense of Definition 5.4/3. 
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is an element a 4 1 in Z. If p" is its order, we see that a?" is of order p. Hence, 
we may assume orda = p. Since a belongs to the center of G, the subgroup 
(a) C G generated by a is normal in G. Then G = G/(a) is of order p*~", using 
1.2/3, and we can apply the induction hypothesis to this group. Hence, there 
exists a chain of subgroups 


G=G, > Giid...DG = {i}, ord G; = pt," 


such that Gy_; is a normal subgroup in G, for £ = 2,...,k. Now consider the 
projection 7: G —+ G/(a) and set Gy = 77!(G) for €=1,...,k. Then clearly, 


G=G,D Ga-1D...D Gi Dd {1} 


is a chain of subgroups in G as desired. 


Proposition 5. For a prime number p, let G be a group of order p?. Then G 
is abelian. More precisely, we have 


GrZ/vPZ or GrZ/pL x Z/pZ. 


Proof. 'To start with we show that G is abelian. From Proposition 3 we conclude 
that p|ord Z for the center Z of G and hence that Z is of order p or p’. If 
ord Z = p*, then G = Z and G is abelian. On the other hand, if ord Z = p, it 
follows that G cannot be abelian. However, G/Z is of order p then, in fact cyclic 
of order p, and we could conclude from 5.1/11 that G is abelian, in contradiction 
to the fact that ord Z = p. 

Now use Corollary 4 and choose an element a € G such that orda = p. 
Furthermore, let b € G belong to the complement of the cyclic subgroup (a) C G 
generated by a. Then b is of order p or p”, where in the latter case G is generated 
by 6, implying G = (b) ~ Z/p?Z. Therefore, assume ordb = p. We claim that 
the map 

yp: (a) x (b) > G, (a’, b’) 3 ald’, 


is a group isomorphism. First, y is a group homomorphism, since we know 
already that G is abelian. Furthermore, we see that (a)M(b) is a proper subgroup 
of (b), since b ¢ (a). Hence, we must have (a) M (b) = {1}, and ¢ is injective. 
But then y is surjective as well, since 


ord((a) x (b)) =p” = ordG. 


Using (a) ~ Z/pZ ~ (b), we get G ~ Z/pZ x Z/pZ, as desired. Alternatively, we 
could have based our argument on the fundamental theorem of finitely generated 
abelian groups 2.9/9. 


After these preliminaries, let us derive the Sylow theorems that were men- 
tioned before; they correspond to the different items of the following result: 
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Theorem 6 (Sylow theorems). Let G be a finite group and p a prime number. 
(i) The group G contains at least one p-Sylow subgroup. More precisely, for 
every p-subgroup H C G there is a p-Sylow subgroup S CG such that H CS. 
(ii) If S CG is a p- Sylow subgroup, then every subgroup in G that is con- 
jugate to S is a p-Sylow subgroup of G as well. Conversely, any two p-Sylow 
subgroups in G are conjugate to each other. 
(iii) The number s of p-Sylow subgroups in G satisfies 


sjordG, s=1 mod (p). 


We divide the proof of the theorem into several parts and start with a 
fundamental lemma. Its proof will be given following an idea of H. Wielandt, 
similarly as in [10], Kap. I, Satz 7.2. 


Lemma 7. Let G be a finite group of order n = p*m, where p is prime, but not 
necessarily relatively prime to m. Then the number s of p-subgroups H CG 
having order ord H = p* satisfies the relation 


Proof. We write X for the set of all subsets in G that consist of precisely p* 


elements. Then 
ord X = (".) 
Pp 


and G acts on X by “left translation” via 
Gx X +X, (g,U) —+ gU = {gu; we U}. 


Different from our previous notation, we write G(U) for the G-orbit of an ele- 
ment U € X; as usual, Gy stands for the stabilizer subgroup of U in G. Viewing 
U as asubset of G, the left translation of G on itself gives rise to an action of Gy 
on U. Therefore, U consists of certain right cosets of Gy in G. These are disjoint 
and consist of ord Gy elements each. Therefore, ord Gy divides ord U = p* and 
hence is of type p*” for some k’ < k. In particular, U itself is a right coset of Gy 
if and only if ord Gy = p*. 

Now let (U;)ier be a system of elements in X representing all G-orbits of 
X. Then the orbit equation 5.1/7 yields 


We want to exploit this equation by taking equivalence classes modulo (pm). 
As we have seen, Gy, is a p-group of order p* for some kj < k. Using the 
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Theorem of Lagrange 1.2/3, this implies (G : Gy,) = p** 


V={ieTl;k, =k}, we get 


'm. Then, writing 


(ord ')-m= SG :Gn,) = (q) mod (om), 


ier" 


and it is enough for the proof of the lemma to show that ord J’ coincides with 
the number s of all p-subgroups H C G of order p*. 

To justify this assertion, recall that an index i € I belongs to I’ if and only 
if ord G(U;) = (G : Gu,) = m, hence if and only if the orbit G(U;) consists 
of precisely m elements. Now consider for a p-subgroup H C G of order p* 
the G-orbit G(H) C X; it consists of the left cosets of H in G, hence, by 
the theorem of Lagrange 1.2/3, of precisely m elements. Two different such 
subgroups H, H’ C G induce different G-orbits, since gH = H’ for some element 
g € Gimplies g € A and therefore H = H’, due to 1 € A’. On the other hand, it 
is easy to see that every G-orbit G(U;), i € I’, is of type G(#) for a p-subgroup 
H Cc G of order p*. Indeed, for i € I’ we have ord Gy, = p*, and as seen above, 
U; is a right coset of Gu, in G, say U; = Gu, + u; for some u; € U;. Then the 
G-orbit of U; in X satisfies 

G(Uj) = Gu; - Ui) = G(u;* + Gu, - ui), 
where now H = u;!- Gy, - u; is a p-subgroup in G of order p*. Therefore, the 
elements i € I’ correspond bijectively to the p-subgroups H C G of order p*, 
and the assertion of the lemma is clear. 


For a cyclic group of order n and a divisor d of n, there is always a unique 
subgroup of order d; cf. 1.3, Exercise 2 and its solution in the appendix. In this 
way, we can read from Lemma 7 the nontrivial relation 


(io) a(n) = md @ 


which leads to the following partial generalization of Corollary 4: 


Proposition 8. Let G be a finite group and p* a prime power dividing ord G. 
Then the number s of p-subgroups H C G of order p* satisfies s=1 mod (p) 
and hence is nonzero. 


In particular, choosing p* as a maximal p-power dividing ordG, we see 
that G contains at least one p-Sylow subgroup, even more specifically, that the 


number of these subgroups is congruent to 1 modulo p. 


Lemma 9. Let G be a finite group, H C G a p-subgroup, and S C G a p-Sylow 
subgroup. Then there is an element g € G such that H Cc gSg7t. 


Proof. On G/S, the set of left cosets of S in G, we consider the H-action 
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Hx G/S — G/S, (h, gS) + (hg)S, 


and apply the theorem of Lagrange 1.2/3, in conjunction with the orbit- 
stabilizer lemma 5.1/6 as well as the orbit equation 5.1/7. The order of every 
H-orbit in G/S divides the order of H and hence is a p-power, since H is a 
p-group. However, p does not divide ordG/S. Therefore, there must exist an 
H-orbit whose order is a trivial p-power p° and hence is 1. Then this H-orbit 
is a left coset gS of S, and we have hgS = gS for all h € H. Since 1 € S, this 
implies hg € gS, or h € gSg7', and therefore H C gSg™t. 


Since the map G —> G, x +> gxg™t, is an automorphism for every g € G, 
we see in the situation of Lemma 9 for every p-Sylow subgroup S in G that gSig7! 
is a p-Sylow subgroup in G as well. If H C G is another p-Sylow subgroup in 
G, an inclusion H Cc gSg7! as in Lemma 9 implies H = gSg~, due to the fact 
that ord H = ord S = ordgSg7. As a consequence, Proposition 8 and Lemma 9 
together imply the assertions of Theorem 6, except for the fact that s|ordG in 
(iii). However, this remaining part will be a consequence of the following result, 
using the theorem of Lagrange 1.2/3: 


Lemma 10. Let G be a finite group and S a p-Sylow subgroup in G. Writing 
Ng for the normalizer of S in G, the index (G : Ng) equals the number of 
p-Sylow subgroups in G. 


Proof. Let X be the set of p-Sylow subgroups in G. Since all p-Sylow subgroups 
are conjugate in G, the conjugation action 


Gx X +X, (g, 8") 3 gS", 
is transitive. In particular, the orbit-stabilizer lemma 5.1/6 yields 
ord X = (G: Gs), 


where Gg, the stabilizer group with respect to the conjugation action, coincides 
with the normalizer Ng. 


Thus, summing up, the proof of Theorem 6 and hence of the Sylow theorems 
is now complete. We want to draw some consequences from these results. 


Corollary 11. Let G be a finite group and p a prime number. Then: 
(i) If plordG, then G admits an element of order p. 
(ii) G is a p-group if and only if for every a € G there exists an exponent 
t EN such that a” = 1. 
(iii) A subgroup H CG is a p-Sylow subgroup if and only if it is a maximal 
p-group in G. 


Proof. Assertion (i) follows from Proposition 8, or alternatively, from Theo- 
rem 6 (i), in conjunction with Corollary 4. 
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To verify (ii), assume that every element a € G admits a p-power as its 
order. If ordG is not a p-power, choose a prime number q different from p that 
divides ord G. Then, as we have seen, G will contain an element of order q, in 
contradiction to our assumption. Therefore, ordG is a p-power, and hence G 
a p-group. Conversely, if G is a p-group, the order of any element a € G isa 
p-power, since orda divides ord G by the theorem of Lagrange 1.2/3. 

Finally, we conclude from 1.2/3 again that every p-Sylow subgroup of G is 
a maximal p-subgroup. The converse of this follows from Theorem 6 (i), so that 
assertion (iii) is clear, too. 


Proposition 12. Let p,q be prime numbers such that p < q and p{(q—1). 
Then every group G of order pq is cyclic. 


Proof. Let s be the number of p-Sylow subgroups in G. Then, by Theorem 6 (iii), 
we have s|ordG, ie., s| pq, as well as s = 1(p). This implies p{s and hence 
s|q. Since q = s = 1(p) is excluded by the condition p{(q— 1), we must have 
s = 1. Hence, there is precisely one p-Sylow subgroup S, in G. It is invariant 
under conjugation with elements of G and therefore normal in G. Likewise, if 
s’ is the number of g-Sylow groups in G, we conclude that s’|p. Again, the case 
s’ = p is excluded, since p = s’ = 1(q) is not compatible with p < q. Therefore, 
we must have s’ = 1, and there is a unique g-Sylow subgroup S, in G, which, as 
before, is normal in G. Since 5, and S, do not admit proper subgroups except 
for the trivial group {1}, we see that 5, S, = {1}. 
Now we claim that the map 


YP: Sp X Sq —> G, (a,b) +> ab, 


is an isomorphism of groups. Knowing this, we can conclude, for example by the 
Chinese remainder theorem in the version of 2.4/14, that G, being the Cartesian 
product of two cyclic groups of relatively prime orders, is itself cyclic. To show 
that y is indeed a group homomorphism, choose elements a € S,, b € S, and 
observe that 

aba~'b~! = (aba~')b7! € Sy, 
as well as 

aba~'b-* = a(ba"b*) € Sp. 
Therefore, 

aba~'b~! € S,M Sy = {1} 

and hence ab = ba, showing that the elements of 5; commute with those of Sq. 
In particular, for a,a’ € S,, and 6, b’ € S, we can write 


y((a,b) - (a’,')) = yaa’, bb’) = aa'bb' 
= aba’b! = y(a, b) : g(a’, b), 


which implies that y is a group homomorphism. Since 5,5, = {1}, it follows 
that vy is injective, and even bijective, since the orders of S, x Sg and G' coincide. 
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Exercises 
1. Review the Sylow theorems and give an outline of the information they provide 
for finite abelian groups. 


2. Let p: G —>+ G" be a homomorphism between finite groups. Try to relate the 
Sylow subgroups of G to those of G’. 


3. Let G be a finite group and H C G a p-subgroup, for some prime number p. If 
# is normal in G, show that H is contained in every p-Sylow subgroup of G. 


4. Let GL(n, K) be the group of all invertible (n x n) matrices over a finite field K 
of characteristic p > 0. Show that the upper triangular matrices with diagonal 
elements equal to 1 give rise to a p-Sylow subgroup of GL(n, K). 


5. Show that every group of order 30 or 56 admits a nontrivial Sylow subgroup that 
is normal. 


6. Show that every group of order 45 is abelian. 


7. Show that every group G of order 36 admits a nontrivial normal subgroup. Hint: 
Consider the action of G on the set of 3-Sylow subgroups of G. 


8. Show that every group G of order ordG < 60 is cyclic or admits a nontrivial 
normal subgroup. 


5.3 Permutation Groups 


In the following we want to have a closer look at the group G,, of bijective 
self-maps of {1,...,n}. As we know already, G, is called the symmetric group 
or the permutation group of {1,...,n}. This group acts naturally on {1,...,n} 
and satisfies ord G,, = n!. Elements 7 € G, are frequently written in the form 


in particular when the images 7(1),...,7(m) are given by explicit expressions. 
A permutation 7 € G,, is called a cycle if there are distinct elements x,..., 2, 
in {1,...,n}, r > 2, such that 


1 (Xj) = Ti+ for 1 < L< Yr, 


m(z)=a2 for ve {l,...,n}—{a1,...,z,}. 


More precisely, in such a situation 7 is called an r-cycle, and one uses the 
notation 7 = (21,...,2,). Two cycles (x1,...,7,) and (y,...,Yys) are called 
disjoint if 

{x1,...,t}A{m,.--, ys} =O. 


A 2-cycle is called a transposition. 
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Proposition 1. Let n > 2. 
(i) If m,m2 € Gy are disjoint cycles, then m0 12 = 7207}. 
(ii) Every permutation 7 € G,, can be written as a product of disjoint cycles. 
These are uniquely determined by 7, up to ordering. 
(iii) Every permutation 7 € G,, is a product of transpositions. 


Proof. Assertion (i) is trivial. In the situation of (ii) we write H = (7) for the 
cyclic subgroup generated by 7 in G,,. Then consider the natural action of H 
on {1,...,n} and look at the corresponding partition of {1,...,} into disjoint 
H-orbits. Let B,,..., Be be the orbits that contain at least two elements, i.e., 
that satisfy r, = ord B, > 2. Choosing a point 7, € By for each A = 1,..., 2, 
we get 

By = {ay, T(X), sie gee aes 


and 
L 
T= Wee T(Xy),+-- sw *(ay)}, 
N=1 


hence a factorization of a into disjoint cycles, where the ordering of factors 
does not matter, due to (i). On the other hand, every factorization of 7 into 
a product of disjoint cycles corresponds, in the manner as explained before, to 
the decomposition of {1,...,n} into its H-orbits. This settles the uniqueness 
assertion. 

Finally, assertion (iii) follows from (ii) using the factorization 


(1, ee 7) = (x1, £2) 7 (12,23) Des os@ (Tp; Bp) 


Given a permutation 7 € G,, one defines its sign or signature by 


sgn 7 = II ay) = m9) : 
ip = A 
It is clear that the sign can assume only the values 1 and —1. If sgn 7 is positive, 
m is called an even permutation, and an odd permutation otherwise. The defi- 
nition of the sign of a permutation counts, so to speak, in a multiplicative way 
modulo 2, the number of two-element subsets {7,7} C {1,...,n} for which the 
map 7 reverses the size relationship between i and j. In particular, the above 
product may alternatively be executed over any index set J of pairs (i,7/) of 
integers 1 < i,7 <n such that the mapping (7,7) +> {7,7} sets up a bijection 
between J and the set of two-element subsets of {1,...,n}. 


Remark 2. The map sgn: 6, —> {1,—1} ts a group homomorphism. 


Proof. Let 7,2’ € Gy. Then 
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ee ee | eee een 


i<j tJ 
_ mom (i) — 10 m'(4) (i) — m'(j) 
“Uae 


=senm-senz’. 


Clearly, the sign of a transposition in G, equals —1. Thus, if we factorize a 
permutation 7 € G,, according to Proposition 1 (iii) into a product of transpo- 
sitions, say 7 = T,0...07, we get sena = (—1)*. It follows that the residue 
class of € modulo 2 is uniquely determined by 7. In particular, 7 is an even or 
an odd permutation depending on whether z is a product of an even or an odd 
number of transpositions. Furthermore, we can conclude from Remark 2 that 


2, = kersgn = {a €6,; sgn7 = 1}, 


i.e., the set of all even permutations, defines a normal subgroup of index 2 in 
G,, provided n > 1. The group 2, is called the alternating group on {1,...,n}. 


Proposition 3. For n > 3, the group 2, consists of all permutations 7 € Gy, 
that can be written as a product of 3-cycles. 


Proof. Consider elements 21, %2,%3,%4 € {1,...,n}. If 71, %9,23 are distinct, 
then the following formula holds: 


(21, 2) 2 (x2, t3) = (11, £2, @3). 
Furthermore, if 71, %2, 23,24 are distinct, then 
(x1, £2) ° (x3, £4) — (x1, £3, £2) 2 (21,09, a). 


The first equation shows that every 3-cycle belongs to 2,, a fact that extends 
to products of 3-cycles as well. Both equations together imply that the product 
of an even number of transpositions, in particular every element of 2,,, is a 
product of 3-cycles. 


To end this section, let us look at some special permutation groups, as well 
as at some subgroups of these. 


(1) The group Gy is of order 2, so that Gg ~ Z/2Z. 


(2) The group 63 is of order 6 and can be viewed as the dihedral group D3, 
i.e., as the group of rotations and reflections of an equilateral triangle (three 
rotations, three reflections). Since G3 contains elements of order 1, 2, and 3, 
but not of order 6, we see that it cannot be isomorphic to the cyclic group 
Z/6Z. On the other hand, every abelian group of order 6 is isomorphic to the 
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product Z/2Z x Z/3Z and therefore is cyclic, due to the Chinese remainder the- 
orem 2.4/14. As a consequence, 63 is nonabelian of order 6, which alternatively, 
can just as well be verified by direct inspection. 

(3) More generally, the dihedral group D,, can be defined for indices n > 3 as 
the group of symmetries (rotations and reflections) of a regular (convex) n-gon. 


Numbering the corners of the n-gon consecutively by 1,...,n, we can view D,, 
as the subgroup of G, that is generated by the permutations 
1 2 3 Bee, Tu 
= (leaesn); r= (; ore ee ae 


The permutation o corresponds to the rotation of the regular n-gon by the angle 
2n/n, and 7 to the reflection across the axis passing through the point 1. It is 
easy to see that D, is of order 2n, and that o generates a cyclic subgroup of 
index 2 in D,. In a similar way one defines symmetry groups for regular convex 
polyhedra, i.e., for tetrahedra, cubes, octahedra, dodecahedra, and icosahedra. 


(4) For later use we want to introduce the Klein four-group G4 as a subgroup 
of G4: 
M, = {id, (1,2)3,4, 0,3)2,4), 0,4)(2,3)}. 
Then 
Wy C Ay C G4, 
and it is easily checked, see Exercise 6, that U4 is a normal subgroup of G4. 
Also note that %4 is isomorphic to Z/2Z x Z/2Z. 


Exercises 


1. Transpositions in Gy, are permutations that exchange two elements of the set 
{1,...,n}, leaving everything else fixed. Give a direct argument for n > 2, showing 
that every 7 € Gy is a product of transpositions. 

2. Given a prime number p, specify a p-Sylow subgroup in Gp. 

r—1 


3. For an r-cycle 7 € G,, show that sgna = (—1) 


4. Consider a permutation 7 € G, and write (7) C G,, for the corresponding cyclic 
subgroup generated by 7. Let m be the number of (7)-orbits with respect to the 
natural action of (7) on {1,...n}. Show that sgna = (—1)"~"™. 


5. Write the following permutations as products of cycles and compute the sign: 
1234 123 45 67 8 
(G41 26% Chere ee 
6. Consider an r-cycle 7 = (x1,...,2,) € Gy and show for arbitrary 0 € G, that 
genoa += (e(x)),...,e(@,)). 
As an application prove that the Klein four-group is a normal subgroup of G4. 


7. Show for n > 2 that the cycles (1,2) and (1,2,...,7) generate the group Gy. 


8. Show for n > 3 that 2,, is generated by the cycles (1,2,3), (1,2,4),...,(1,2,7). 
Conclude from this that a normal subgroup N C 2, coincides with 2, as soon 
as it contains a 3-cycle. 
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In order to characterize solvable groups we need the notion of commutator. For 
two elements a,b of a group G, one calls [a,b] = aba~'b~! the commutator of 
a and b. In a similar way, we define the commutator [H, H’] of two subgroups 
H, H' Cc G as the subgroup that is generated in G by all commutators [a,b] 
for elements a € H, b € H’. In particular, taking H = H’ = G, we obtain the 
commutator group [G,G] of G. Let us point out that a group G is abelian if 
and only if [G,G] = {1}. 


Remark 1. (i) [G,G] consists of all (finite) products of commutators in G. 
(ii) [G,G] is a normal subgroup in G, the smallest of all normal subgroups 


N CG such that G/N is abelian. 


Proof. For a,b € G we have 


(a,b) = (aba*b)* = baba = [ba], 


Therefore, the finite products of commutators form a subgroup in G, the com- 
mutator group [G,G]. Furthermore, for a, b,g € G we can write 


gla, b]g~* = gaba~*b~*g~* = (gag~')(gbg~*)(gag™")*(gbg™")~* 
= [gag™', gbg™"). 


Hence, [G,G] is a normal subgroup in G. Now write % for residue classes in 
G/[G,G] of elements x € G. Then the equation 


a-b-a7-b =obab =1 


shows that G/[G,G] is abelian. On the other hand, if N C G is a normal 
subgroup such that G/N is abelian, then N must contain all commutators [a, b] 
of elements a,b € G. But this implies [G,G] C N, and it follows that [G, G] 
is the smallest of all normal subgroups N C G such that G/N is abelian. 


For later use, let us determine some special commutator groups. 


Remark 2. The permutation groups Gn, An satisfy 


[Gn,Gn] = 2, for n > 2, 
{1} for n = 2,3, 

(Me; An)| = 4 By forn=—4, (Klein four-group) 
2, forn>5. 


Proof. We start with the computation of [G,,G6,]. Since G,/2, ~ Z/2Z is 
abelian, we get [Gn,6n] C &%, by Remark 1 and in particular [G2, G2] = A, 
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since %_ = {1}. To justify the inclusion [G,,6,] D 2, for n > 3, we use the 
fact that every element of 2, is a product of 3-cycles; cf. 5.3/3. However, every 
3-cycle (x1, %2, 73) € Gy is a commutator, since we can write 


(©1, 22,03) = (21, 23)(Lo, £3)(L1, 23) (T2,3)*. 


Thus, we get [G,,6,] D> %, and hence [Gyr,G,] = An. 

Next observe that the groups 2l2 and 23 are of order 1, resp. 3, and therefore 
are abelian. As a consequence, the commutator group [2,,%n] is trivial for 
n = 2,3. Furthermore, for n > 5, consider a 3-cycle (x1, #2, 23) in G,. Choosing 
U4,U5 € {1,...,n} such that x,,...,x5 are distinct, we get 


(1, 2,23) = (@1, F2,€4)(1, 3, Bs) (21, La, 4)" (21, 23,05). 


Since 2, consists of all finite products of 3-cycles by 5.3/3, it follows that every 
element of 2,, is a product of commutators in 2,,. This implies 2, C [2,, Xn] 
and hence [n,n] = An. 

It remains to justify the relation [2l4, 2%4] = Wy. According to Remark 1 (ii), 
we know that [24,24] equals the smallest normal subgroup N C 4 such 
that the factor group 2%4/N is abelian. Since 24,/%, is of order 3 and hence 
abelian, we conclude that [24,24] C Wy. On the other hand, distinct elements 
%1,..-.,%4 € {1,...,4} lead to the equation 


(11, %2)(%3,24) = (x1, €2, %3)(21, £2, £4)(21, Lo, %3)" (1, La, £4), 


showing that UW, = {id, (1,2)(3,4), (1,3)(2,4), (1,4)(2,3)} is contained in 
[24 , Aa]. 


In the following we want to use commutators in order to characterize so- 
called solvable groups. To do this we introduce for a group G and an integer 
i EN its ith iterated commutator D'G inductively by 


DG=G, |e =(p'E, pc): 


In this way, we obtain a chain 


G=D°Go D'G2D...D DGD... 


of subgroups in G, where in each case, D‘t!G is normal in D'G. In addition, 
D'G/D**'G is abelian. More specifically, chains of the latter type are used to 
define solvable groups: 


Definition 3. Let G be a group. A descending chain of subgroups 
G=G)2G,D...DG,= {1} 


is called a normal series of G if in each case, Gj, is a normal subgroup of G;. 
The residue class groups G;/Gisi1, i= 0,...,n—1, are called the factors of the 
normal series. 
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The group G is called solvable if G admits a normal series with abelian 
factors. 


Proposition 4. A group G is solvable if and only if there is an integer n € N 
such that D"G = {1}. 


Proof. First, assume that G is solvable and let 
G=G2G,D...DG,= {Il} 


be a normal series with abelian factors. We show by induction that D'G Cc G; 
for i = 0,...,n. For 7 = 0 the inclusion holds for trivial reasons. Now assume 
that D'G C G; for some i < n. Since G;/Gi41 is abelian, we get [Gi,Gi] C Giy1 
by Remark 1 (ii) and hence 


D*'G = [D'G, D'G] c [Gi, Gi] C Gi, 
as desired. In particular, it follows that 
D°G CG, = {1}. 
Conversely, if DG = {1}, then 
G=DG>D D'GD...D D"G = {1} 


is a normal series with abelian factors. 


Let us consider some examples. Note that every commutative group is solv- 
able for trivial reasons. 


Remark 5. The symmetric group 6, is solvable for n < 4, but not solvable if 
n> 5. 


Proof. There are the following normal series of G,, for n < 4: 


6, > {l}, 
63 D As D {1}, 
G4 Dd Ay D Wy D {1}. 


That the factors of these normal series are abelian is easy to see. The groups Ge, 
63/23, and 64/4 are cyclic of order 2, whereas the groups 23 and Wl4/Wy are 
cyclic of order 3, so that the commutativity is clear in these cases. Furthermore, 
the Klein four-group U4 is commutative as well. Therefore, G, is solvable for 
n < 4. For n > 5 we have [G,,G6,] = %, as well as [%,,%,] = Wn, see 
Remark 2, so that G, cannot be solvable in this case. 


Remark 6. For a prime number p, every (finite) p-group, hence every group 
of order p" for some n EN, is solvable. 
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The assertion was already proved in 5.2/4. Next we want to derive a special 
characterization of finite solvable groups that will be of interest later when we 
study the solvability of algebraic equations. 


Proposition 7. Let G be a finite solvable group. Then every strictly decreasing 
normal series in G with abelian factors can be refined to a normal series whose 
factors are cyclic of prime order. 


Proof. Let G = Go D ... D Gy be a strictly decreasing normal series with 
abelian factors. If one of the factors, say G;/Gj41, is not cyclic of prime order, 
choose a nontrivial element @ € G;/Gj41, where replacing @ by a suitable power 
of itself, we may assume that ord@ is prime. Since G;/Gi41 is not cyclic, we 
see that (a), the cyclic group generated by @, is strictly contained in G;/Gi41. 
Therefore, its preimage with respect to the projection G; —> G;/Gi41 yields a 
subgroup H C Gj satisfying 


G; 2 H 2 Gist. 


Since (@) is a normal subgroup of the (abelian) group G;/Gi41, it follows that 
its preimage H is a normal subgroup in G;. Since G;,; is normal in G; and 
thus also in H, we may refine the normal series Gp D ... D G,, by inserting H 
between G; and G41, thereby obtaining a new normal series of G. It has abelian 
factors as well, due to the injection H/Gi+; @ G;/Gi4; and the epimorphism 
Gi/Gis1 — G;/H, where G;/Gj41 is abelian. Repeating the refining process if 
necessary and using the finiteness of G, we arrive after finitely many steps at a 
normal series whose factors are cyclic of prime order. 


Proposition 8. Let G be a group and H CG a subgroup. If G is solvable, the 
same is true for H as well. If H is normal in G, then G is solvable if and only 
if H and G/H are solvable. 


Proof. First, assume that G is solvable. Then, since D'H C D'G, we sce that 
HT is solvable as well. Furthermore, if H is normal in G, we can consider the 
canonical epimorphism 7: G —> G/H. Since D*((G)) = 7(D'(G)), as is easily 
verified, we see that G/H = 7(G) is solvable if G is solvable. 

Conversely, assume that H and G/H are solvable, say D°H = {1} and 
D"(G/H) = {1}. Then we get 


m(D"G) = D"(G/H) = {1}, 
i.e., DG C H, and furthermore, D?"G C D"H = {1}. Therefore G is solvable. 


Corollary 9. The Cartesian product [[;_, Gi of finitely many groups Gy,...,Gn 
is solvable if and only if all groups G; are solvable. 


Proof. Conclude by induction, and for n = 2, apply Proposition 8 to the pro- 
jection Gy x Gy —> G2, which admits G; as its kernel. 
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Exercises 


ile 


We have seen in Remark 1 for a group G that its commutator group [G,G] equals 
the smallest of all normal subgroups N C G such that the factor group G/N is 
abelian. Prove a corresponding assertion for commutators of type [G,H], where 
HT is a normal subgroup, or alternatively, any subgroup in G. 


Let p,q be distinct prime numbers. Show that every group of order pq is solvable. 
For a finite group G show: 


(i) If H, H’ are normal subgroups in G that are solvable, then H - H’ is solvable 
as well. 


(ii) There exists a unique largest subgroup in G that is solvable. This subgroup 
is invariant under all automorphisms of G. 


Show that every group of order < 60 is solvable. 
Show that the alternating group 25 does not admit a nontrivial normal subgroup. 


Let T be the subgroup of upper triangular matrices in GL(n, K), the group of 
invertible (n x m) matrices over a field K. Show that T is solvable. 


. For a group G consider the subgroups C’(G) that are inductively given by 


C1(G) = G and C**!(G) = [G,C"(G)]. The group G is called nilpotent if there is 
some n € N such that C"(G) = {1}. Show that every nilpotent group is solvable. 


. In the situation of Exercise 6, consider for kK #¢ F2 the group of all upper trian- 


gular matrices T C GL(n, K), as well as its subgroup 7, C T consisting of all 
triangular matrices with diagonal elements equal to 1. Show that 7] is nilpotent 
(cf. Exercise 7), but that Tis not. Therefore, T is an example of a solvable group 
that is not nilpotent. 


6. Applications of Galois Theory ®) 


Check for 
updates 


Background and Overview 


In the previous chapters we have developed group theory, field theory, and in 
particular, Galois theory to a sufficiently high level, allowing us to consider 
applications to some prominent classical problems. We start in 6.1 with the 
problem of solving algebraic equations by radicals, which is the problem that 
motivated E. Galois to work out his “Galois” theory. In particular, we show for 
a monic separable polynomial f with coefficients in a field K that the algebraic 
equation f(a) = 0 is solvable by radicals if and only if the corresponding Galois 
group is solvable in the group-theoretic sense. 

The basic idea of proof is easy to explain. On the side of fields, the problem 
is reduced to a setting in which K contains sufficiently many roots of unity. 
Then one repeatedly extends K by adjoining suitable radicals, i.e., zeros of 
polynomials of type X" —c € K[X] for char K{n, and if p = char K > 0, also 
of type X? — X —c € K[X]. These extensions represent essentially all cyclic 
Galois extensions of K; see 4.8/3 and 4.8/5. In a similar way, we use on the 
level of Galois groups the fact that the finite cyclic groups are, so to speak, the 
“building blocks” of the finite solvable groups; cf. 5.4/7. Furthermore, let us 
point out that it is common practice to view the zeros of polynomials of type 
X? — X —c€ K[X] for p= char K > 0 as “radicals” as well. Only in this way 
does the characterization of solvable (separable) algebraic equations in terms of 
solvable Galois groups remain valid for fields of characteristic > 0. Of course, 
polynomials of type X? — c for p = char K > 0 are not separable, which means 
that their zeros cannot be studied using methods of Galois theory. Finally, in 
6.1/10 we give a necessary condition for an irreducible algebraic equation of 
prime degree to be solvable. The latter criterion, which goes back to E. Galois 
as well, can be used to easily set up examples of algebraic equations that are not 
solvable. To further illustrate the solvability problem we work out in Section 6.2 
the explicit formulas for the solutions of algebraic equations of degrees 3 and 4. 

As a second application, we present in 6.3 a Galois-theoretic proof of the 
fundamental theorem of algebra. From an algebraic point of view, working on 
this theorem can be quite tricky, as is clearly visible from first proofs. The 
difficulties are caused by the fact that the field C of complex numbers relies on 
the field of real numbers R, by adjoining a square root of —1. Constructing the 
field R, however, requires methods from analysis. Therefore, given a polynomial 
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f € CLX], chances are very low that one can realize its zeros in an algebraic 
way as elements of C. To avoid such difficulties, we proceed indirectly. If C were 
not algebraically closed, one could use Kronecker’s construction to arrive at a 
nontrivial extension L/C, which can be assumed to be Galois. Then we use a 
first (analytic) fact, namely that real polynomials of odd degree admit at least 
one real zero, and show in terms of Galois theory that we may assume L/C to be 
of degree 2. However, such an extension cannot exist. This becomes clear if we 
use as a second (analytic) fact that positive real numbers admit a square root 
in R, and hence all complex numbers admit a square root in C. In particular, 
our proof depends on the mentioned “analytic” properties of the real numbers. 

In 6.4 we discuss another application, compass and straightedge construc- 
tions in the complex plane. A thorough analysis of the construction steps that 
are possible in such a setting shows that starting with the points 0,1 € C, one 
can construct only points z € C that are contained in Galois extensions L/Q 
whose degree [L : Q] is a power of 2. In particular, z is then algebraic over Q, 
of a degree that is a power of 2. In this way, the constructibility of the cube 
root W2 is excluded, and it follows, for example, that the ancient problem of 
doubling the cube is not accessible in terms of compass and straightedge con- 
structions. Another topic we elaborate on is the study of C. F. Gauss on the 
constructibility of regular convex polygons. 


6.1 Solvability of Algebraic Equations 


Even if the solution formulas for algebraic equations in degrees 1 and 2 look quite 
simple, the more complicated formulas in degrees 3 and 4, which we will derive in 
Section 6.2, make it undoubtedly clear that the problem of solving more general 
algebraic equations is not so easy. In fact, we will see starting from degree 5 on 
that as a matter of principle, universal solution formulas for algebraic equations 
cannot exist. In order to look at the corresponding background more closely, let 
us make the notion of solvability of algebraic equations precise. 


Definition 1. A finite field extension L/K is said to be solvable by radicals if 
L admits an extension field E together with a chain of field extensions 


K=E,CE,C...CEn=E 


such that in each case, E;4, is obtained from E; by adjoining an element of the 
following type: 


(1) a root of unity, or 
(2) a zero of a polynomial of type X" —a € E;|X], where char K{n, or 
(3) a zero of a polynomial of type XP — X —a € E;[X] for p = char K > 0. 


Then L/K is necessarily separable. 
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The main goal of the present section is to characterize solvability by radicals 
in terms of the solvability of Galois groups in the group-theoretic sense. 


Definition 2. A finite field extension L/K is called solvable if there exists an 
extension field E D> L such that E/K is a finite Galois extension with solvable 
Galois group Gal(E’/K) (in the sense of 5.4/3). 


Using this definition, observe that a Galois extension L/K is solvable if and 
only if the Galois group Gal(L/K) is solvable. Indeed, if we can enlarge L/K 
to a finite Galois extension E’/K with solvable Galois group, then Gal(L/K’) is 
a quotient of Gal(E’/K) by 4.1/2 and thus solvable by 5.4/8. 

The two notions of solvability extend naturally to the context of algebraic 
equations. If f is a nonconstant (separable) polynomial with coefficients in a 
field kK, we can choose a splitting field L of f over K. Then we say that the 
algebraic equation f(x) = 0 is solvable over K, resp. solvable by radicals, if the 
extension L/K admits the corresponding property. 

We want to prove some more or less elementary properties of the two solv- 
ability notions. 


Lemma 3. Let L/K be a finite field extension and F an arbitrary extension field 
of K. Embed L via a K-homomorphism into an algebraic closure F of F, see 
3.4/9, and look at the composite field FL in F. Then, if L/K is solvable (resp. 
Galois with solvable Galois group, resp. solvable by radicals, resp. exhaustible by 
a chain of field extensions as in Definition 1), then the same is true for the 
extension FL/F as well. 


Lemma 4. Given a chain of finite field extensions K C LC M, the extension 
M/K is solvable (resp. solvable by radicals) if and only if M/L and L/K are 
solvable (resp. solvable by radicals). 


Proof of Lemma 3. First assume that L/K is solvable. Enlarging L, we may 
restrict ourselves to the case that L/K is Galois with solvable Galois group 
Gal(L/K). Then FL = F(L) is a finite Galois extension of F’. Since every 
a € Gal(F'L/F) leaves the field K fixed, it follows that o(L) is algebraic over K. 
In particular, we conclude from 3.5/4 that there is a restriction homomorphism 


Gal(FL/F) — Gal(L/K). 


This homomorphism is injective, since FL = F(L). Therefore, the solvability 
of Gal(FL/F) and hence of F'L/F follows from 5.4/8. On the other hand, if 
L/K is solvable by radicals, resp. exhaustible by a chain of field extensions as 
in Definition 1, then the same is true by trivial reasons for the extension F'L/F. 


Proof of Lemma 4. Again we start by considering the property “solvable.” As- 
sume first that M/K is solvable. Enlarging M, we may assume M/K to be 
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Galois with solvable Galois group. Then, by definition, L/K is solvable, too. 
Furthermore, since Gal(//L) can be viewed as a subgroup of Gal(M/K’), we 
conclude from 5.4/8 that M/Z is solvable. 

Now assume that the extensions M/Z and L/K of the chain K Cc LC M 
are solvable. In a first step we show that both extensions can be assumed to 
be Galois with solvable Galois group. To achieve this, choose a finite extension 
L’ of LE such that L’/K is Galois with solvable Galois group. Then we can use 
Lemma 3 and thereby replace L by L’ as well as M by the composite field L'M 
(in an algebraic closure of M). Furthermore, there exists a finite extension M’ of 
L'M such that M’'/L’ is Galois with solvable Galois group. Now, replacing L’ M 
by M’, we may assume in the following that both M/Z and L/K are Galois 
with solvable Galois group. 

Since M is separable, but not necessarily Galois over K’, we pass to a normal 
closure M’' of M/K, see 3.5/7, where now M'/K is a finite Galois extension. To 
construct M’ we consider all K-homomorphisms 0: M —+ M into an algebraic 
closure M of M and define M’ as the composite field of all fields c(M). Since 
L/K is Galois, we get o(L) = L for all o, and it follows that every extension 
o(M)/L is a Galois extension that is isomorphic to M/L. Now we claim that 
the Galois group Gal(M'/K’) and hence the extension M/K are solvable. To 
justify this, look at the surjective restriction homomorphism 


Gal(M’/K) —> Gal(L/K) 
admitting Gal(M'/L) as its kernel; cf. 4.1/2 (ii). Since Gal(L/X) is solvable, it is 
enough by 5.4/8 to show that Gal(M’/L) is solvable. However, using 4.1/12 (ii), 
the latter group can be viewed as a subgroup of the Cartesian product 


[[] — Gal(e(n)/Z). 
o€Hom x (M,M) 
All groups Gal(o(M)/L) = Gal(o(M)/o(L)) are canonically isomorphic to 
Gal(M/L) and hence are solvable. By 5.4/9, the Cartesian product of these 
groups is solvable as well, and it follows from 5.4/8 that Gal(M’/L) is solvable. 
This finishes the proof of Lemma 4 for the property “solvable.” 

It remains to look at the property “solvable by radicals.” If //K is solvable 
by radicals, the same clearly holds for the extensions M/Z and L/K as well. 
Conversely, if M/L and L/K are solvable by radicals, choose an extension L’/L 
such that L’/K can be exhausted by a chain of field extensions as specified in 
Definition 1. Then consider the composite field L’M in an algebraic closure of 
M and use the fact that L'M/L’ is solvable by radicals due to Lemma 3. It 
follows that L’M/K is solvable by radicals and that the same is true for M/K. 


Theorem 5. A finite field extension L/K is solvable if and only if it is solvable 
by radicals. 


Proof. Assume first that L/K is solvable. Enlarging ZL, we may assume that 
L/K is Galois with solvable Galois group. Let m be the product of all prime 
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numbers q # char K dividing the degree [L : K]. Furthermore, let F be an 
extension field of K, obtained by adjoining a primitive mth root of unity. Then, 
by definition, the extension F'/K is solvable by radicals. Now look at the chain 
of fields 

KCFCFLI, 


where the composite field F'L is constructed in an algebraic closure of K. It is 
enough to show (see Lemma 4) that FL/F is solvable by radicals. To do this, 
we know from Lemma 3 that F'L/F is solvable, even a Galois extension with 
solvable Galois group, since the corresponding property was assumed for the 
extension L/ I<. Now choose a normal series 


Gal(FL/F) =Go > G1 D... > G, = {1} 


with factors that are cyclic of prime order; cf. 5.4/7. By the fundamental theo- 
rem of Galois theory 4.1/6, this corresponds to a chain of field extensions 


Pehene...0 Pe hL 


such that in each case, F;;,/F; is a cyclic Galois extension of prime order, 
say of some order p;. Observing that [FL : F'] divides [L : K], for example, 
using 4.1/12 (i), we see for p; 4 char K that the prime p; will divide m. In 
particular, F' and hence F; contain a primitive p;th root of unity. Therefore, we 
can apply 4.8/3 to conclude that Fj,, is obtained from F; by adjoining a zero 
of a polynomial of type X?' — a € F;|X]. On the other hand, if p; = char K, 
we see from 4.8/5 that Fj,, is constructed from F; by adjoining a zero of a 
polynomial of type X?' — X —a € F;[ X]. In either case, it follows that F'L/F 
and hence L/K are solvable by radicals. 

To verify the reverse implication, assume that L/K is solvable by radicals. 
Then there exists a chain of field extensions K = Ky C Ky, Cc... C K,, such 
that ZL C K,,, and in each case, the extension Kj41/K; is of type (1), (2), or 
(3) in the sense of Definition 1. Enlarging L, we may assume L = K,,. To 
prove that L/K is solvable it is enough by Lemma 4 to show that all extensions 
Kj41/K; are solvable. In other words, we may assume that the extension L/K 
is of type (1), (2), or (3) in Definition 1. Now observe that the extensions of 
type (1) and (3) are Galois, in fact abelian by 4.5/7 for type (1) and cyclic by 
4.8/5 for type (3). In particular, in both cases L/K is solvable. In the remaining 
case, in which L/K is of type (2), the field L is obtained from K by adjoining 
a zero of a polynomial X" — c € K[X], for some exponent n not divisible by 
char kK. To handle this case, consider an extension F'/K that is generated by 
a primitive nth root of unity. Then we can consider the chain kK C FC FL, 
where the composite field FL is constructed in an algebraic closure of L. We 
know from 4.5/7 that F'/K is an abelian Galois extension, whereas F'L/F is a 
cyclic Galois extension by 4.8/3. In particular, both extensions are solvable, and 
we see from Lemma 4 that F'L/K and hence L/K are solvable as well. Thus, 
we are done. 
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Corollary 6. Let L/K be a separable field extension of degree < 4. Then L/K 
is solvable, and in particular, solvable by radicals. 


Proof. By the primitive element theorem 3.6/12, the extension L/K is simple, 
say L = K(a). Let f € K[X] be the minimal polynomial of a over K and let 
L' be a splitting field of f over K. Then deg f = [L: Kk] < 4, and the Galois 
group Gal(L’/K) can be viewed as a subgroup of G4, due to 4.3/1. Since Ga, 
and therefore all its subgroups are solvable (see 5.4/5 and 5.4/8), we conclude 
that L’/Kk and L/K are solvable. 


Corollary 7. There exist finite separable field extensions that are not solvable 
by radicals. For example, the generic equation of degree n is not solvable by 
radicals for n > 5. 


Proof. It is enough to know that the generic equation of degree n admits the full 
permutation group G,, as its Galois group for n > 2; cf. Section 4.3, Example (4). 
Since G,, is not solvable for n > 5 by 5.4/5, we see from Theorem 5 that the 
corresponding extension L/K cannot be solvable by radicals in this case. 


Let us review once more Example (4) of Section 4.3. Starting out from a field 
k, we have considered the rational function field L = k(T,...,T7;,) in a number 
of variables 7,,...,7;,. The permutation group G, acts on L by permuting 
the T;, and L was recognized as a Galois extension of the corresponding fixed 
field K, with Galois group Gal(L/K) = G,. The fixed field Kv itself could be 
identified with K = k(s1,...,5n), where s1,..., 8, are the elementary symmetric 
polynomials in T;,...,7;,. In fact, we have seen that L is a splitting field of 
the polynomial f = X” — 5,X""'+...+(—1)"s, € K[X]. Since the elements 
$1,---,5n © K are algebraically independent over k by the fundamental theorem 
on symmetric polynomials 4.3/5, or 4.4/1, the coefficients —s,,...,(—1)"s, can 
just as well be viewed as variables over k. In this way, we can state for n > 5 and 
variables c,...,C, over k that a” +cya"-t+...+¢, =0, the generic equation 
of degree n over the rational function field K = k(ci,...,¢,), is not solvable by 
radicals. 

More specifically, we can ask whether there exist algebraic equations that 
are not solvable by radicals, even over the field Q of rational numbers. This 
question will be studied in the following, restricting ourselves to equations of 
prime degree, however. We start with two auxiliary results on permutations that 
subsequently will be applied to Galois groups. 


Lemma 8. For a prime number p, let GC G, be a subgroup acting transitively 
on {1,...,p}. Then G contains a subgroup H of order p. If G is solvable, H is 
uniquely determined and in particular normal in G. 


Proof. Since G acts transitively on {1,...,p}, there exists only a single G-orbit. 
It consists of p elements, and we see, for example using 5.1/6, that p divides 
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ordG. Since p? does not divide the order of G,, which is p!, we can exclude 
that p? divides ord G. Therefore, G contains a subgroup HA of order p, namely 
a p-Sylow subgroup; see 5.2/6. 

Assuming now that G is solvable, it is seen from 5.4/7 that G admits a 
normal series G = Gp 2... D Gn = {1} whose factors are cyclic of prime order. 
We want to show by induction that each G; acts transitively on {1,...,p} for 
i <n. This being clear for i = 0, let i > 0. Since G; is normal in G;_,, we can 
look at the relation g(Gjx) = Gi(gx) for elements g € G;_; and # € {1,...,p}. 
This shows that G;_; acts on the G;-orbits in {1,...,p}. Hence, since G;_, acts 
transitively on {1,...,p} by the induction hypothesis, all G;-orbits in {1,..., p} 
are of the same order. Thus, if By,...,B, are the orbits of the action of G; 
on {1,...,p}, we get p = pve ord B, = r- ord By, which implies r = 1 or 
ord B; = 1. However, since G; # {1} for 7 < n and therefore ord B, > 1, we 
must have r = 1. As a consequence, there is only a single orbit of the action 
of G; on {1,...,p}, and hence G; acts transitively. In particular, G; contains a 
subgroup of order p for i <n, as shown before. For i = n — 1 this means that 
G,-1 is of order p, since Gp_1 ~ Gn_i/Gp is of prime order. 

A repeated application of the theorem of Lagrange 1.2/3 yields the relation 
ordG = ia ord G;/Gj41. Since p divides ordG, but p? does not, we must 
have p 4 ord G;/Gi41 for i = 0,...,2—2. Now, departing from H C Go, we get 
HC G; for i = 0,...,n —1 by an inductive argument. Indeed, if H C G; for 
some 7 <n — 2, then the canonical map 


Ha G; —_ Gi /Gist 


is trivial, since pford G;/Gj41, thus implying H C G;,,. In particular, we get 
H C G,_1 and hence H = G,_1, since G,_1 is of order p. This shows that 
HT is unique. But then H is invariant under conjugation by elements of G and 
therefore a normal subgroup in G. 


Lemma 9. In the setting of Lemma 8, let G be a solvable group and consider an 
element 0 € G. If o, as a bijective self-map on {1,...,p}, admits two different 
fixed points, then o = id. 


Proof. Following Lemma 8, there is a normal subgroup H of order p in G, and 
H is necessarily cyclic of order p, say generated by some element 7 € G C Gy. 
Factoring 7 into a product of disjoint cycles, see 5.3/1 (ii), and using ord a = p, 
it follows that 7 is already a p-cycle itself. We write 7 = (0,...,p — 1) where 
for convenience, as we will see below, we view G, as the group of permutations 
of the elements 0,...,p — 1. Now consider a permutation 0 € G admitting two 
different fixed points. By a renumbering process we may assume that one of 
them is the element 0. Therefore, let 0,7 where 0 < 72 < p, be two fixed points 
of a. Since H is normal in G, the element 


gomog' = (a(0),...,0(p—1)) 


belongs to H as well and hence equals a power a” for some 0 < r < p, say 
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(o(0),...,0(p— 1)) = (0,7-T1,...,7-(p— 1), 


where 7-7 indicates the remainder in {0,...,p — 1} on applying Euclidean 
division by p to r- 7. Now o(0) = 0 and o(i) =i show that r-7 =i, and hence 
7-i=r-t=i in Z/pZ. However, this implies F = 1 and therefore r = 1, since 
7 is a unit in Z/pZ, due to 0 < i < p. Consequently, we have o = id. 


Now let us apply the assertion of Lemma 9 to Galois groups and thereby 
derive some consequences for Galois extensions. 


Proposition 10. Let K be a field and f € K[X] an irreducible separable 
polynomial of prime degree p with splitting field L over K. Assume that the 
corresponding Galois group Gal(L/K) is solvable. Then L = K(a,{) for any 
two different zeros a, 8 of f. 


Proof. Every element o € G = Gal(L/K) induces a permutation of the zeros 
Q1,...,Q@, of f, and we may view G as a subgroup of the permutation group 
6,; see 4.3/1. Given two zeros a, € L of f, the irreducibility of f implies 
that there is an element o € G such that o(a@) = 6. Therefore, the action of 
G on {ay,...,Q@p} is transitive. In addition, G is solvable by assumption and 
hence satisfies the assumptions of Lemma 9. Thus, if a 4 6 and o € G is an 
automorphism of L leaving (a, 3) fixed, then a, as a permutation of ay,...,Qp, 
admits two different fixed points, namely a and (, and therefore must equal the 
identity. In particular, we get Gal(Z/K(a,)) = {1} and hence L = K(a, 3) 
by the fundamental theorem of Galois theory 4.1/6. 


The assertion of Proposition 10 allows the construction of a multitude of 
unsolvable finite field extensions of Q. Indeed, if f € Q[X] is an irreducible 
polynomial of prime degree p > 5 admitting at least two real zeros and one 
nonreal zero in C, then the equation f(a) = 0 cannot be solvable. Otherwise, 
we could conclude from Proposition 10 that the splitting field of f in C would 
be real, in contradiction to the fact that f admits nonreal zeros. As an example, 
consider for prime numbers p > 5 the polynomial f = X? — 4X +2 € Q[X], 
which is irreducible by Eisenstein’s criterion 2.8/1. By curve sketching one real- 
izes that f admits precisely three real zeros. Therefore, the corresponding Galois 
group cannot be solvable. Alternatively, for p = 5, we can show that the Galois 
group G of f = X° — 4X 4+ 2 is isomorphic to G5. Indeed, if we view G as a 
subgroup of Gs, see 4.3/1, then G contains an element of order 5, for example 
by Lemma 8, and hence a 5-cycle. Furthermore, complex conjugation permutes 
the two nonreal zeros of f, leaving the remaining three real zeros fixed. There- 
fore, G contains a transposition as well. But this implies G = G5; cf. Exercise 7 
in Section 5.3. Using such an argument, one can show more generally for every 
prime number p that there is an irreducible polynomial f € Q[X] of degree p 
whose corresponding Galois group is isomorphic to G,; cf. Exercise 5. 
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Exercises 


1. Let K be a field and f € K[X] a nonconstant separable polynomial. Let Ko be the 
smallest subfield of K containing all coefficients of f. Discuss the relationship 
between the solvability of the equation f(x) =0 over K and over Ko. 


2. Let K be a field and f € K[X] a separable nonconstant polynomial. Using older 
terminology, an algebraic equation f(x) = 0 is called metacyclic if it can be 
reduced to a chain of cyclic equations. This means the following: If L is a splitting 
field of f over K, then there is a chain of fields K = Ko C Ki Cc... C Kn, 
where L Cc Ky and in each case, Kj41/K; is a Galois extension given by a cyclic 
equation, hence with cyclic Galois group. Show that the equation f(x) = 0 is 
metacyclic if and only if it is solvable (resp. solvable by radicals). 


3. Determine the Galois group of the polynomial 
X7 —8x° —4xX*49x9 —4x* 496 Q/[X] 


and check whether it is solvable. 


4, Verify whether the equation 


i” 5 lWy3 1 2 
X'+4K° — FX" —-4X + 5 =0 
with coefficients in Q is solvable by radicals. 


5. Show for every prime number p > 5 that there exists an irreducible polynomial 
fp € QLX] of degree p whose corresponding Galois group (over Q) is isomorphic 
to G,. Hint: Consider a separable polynomial h, € Q[X] of degree p admitting 
exactly two nonreal zeros. Then approximate h, by a suitable irreducible poly- 
nomial f,. In doing so, use the principle of continuity of roots, i.e., that the zeros 
of hp, change in a continuous way when one makes continuous changes to the 
coefficients of hy. 


6. For a prime number p and the field F, = Z/pZ consider the group S(F,) of 
bijective self-maps F, —> F,. An element o € S(IF,) is called linear if there are 
elements a,b € F, such that o(x) = az+b for all a € F,, where necessarily a 0. 
A subgroup G C S(F,) is called linear if all elements o € G are linear. Finally, 
a subgroup G C G, is called linear if there exists a bijection {1,...,p} —> F, 
transferring G to a linear subgroup of S(F,). Show: 


(i) If o € S(F,) is linear and admits at least two different fixed points, then 
o = id. 


(ii) Every subgroup G C Gy, that is solvable and acts transitively on {1,...,p} 
is linear. 


(iii) Every linear subgroup G C Gy is solvable. 


(iv) The Galois group of an irreducible polynomial of degree p is linear if it is 
solvable. 
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Let K be a field, f € AK[X] a separable monic polynomial, and L a splitting 
field of f over kK. As we have seen, the algebraic equation f(x) = 0 is solvable 
by radicals if and only if the corresponding Galois group Gal(L/I) is solvable 
in the group-theoretic sense. This is equivalent to the existence of a normal 
series 


Gal(L/K) =G) DG, 5... G, = {1} 


with (finite) cyclic factors; cf. 5.4/7. On the other hand, starting with such a 
normal series, the fundamental theorem of Galois theory 4.1/6 shows that the 
series corresponds to a chain of field extensions 


K=Hheh cc... chet, 


where £;/E;-; for i = 1,...,r is a cyclic Galois extension with Galois group 
G;_1/G;. In such a setting, the key ingredient for solving the equation f(x) = 0 
by radicals is given by the characterization 4.8/3 (i) of cyclic extensions: Under 
the assumption that F;_, contains a root of unity of order n; = [E; : E,_1], 
where char Kk does not divide the degree n,, it follows that E; is obtained from 
Ej by adjoining an n;th root of some element c; € E;. However, let us point 
out that the existence of c; was obtained only in a nonconstructive way, using 
Hilbert’s Theorem 90. 

To arrive at a solution formula for the equation f(x) = 0 when dealing with 
a specific polynomial f, we can proceed as just explained, trying to describe 
the relevant field extensions in explicit terms. Since we are interested only in 
polynomials f of degrees 2,3, or 4, we can view the Galois group Gal(L/K) as 
a subgroup of Go, G3, resp. Gy. For these permutation groups the normal series 


62 DX = {1}, 


63 D As D {1}, 
64D XD Wz, 3D {1} 


are at hand, where the notation is as in Section 5.3. Recall that 2, is the 
alternating group of index n and YW, the Klein four-group. Furthermore, 3 is a 
cyclic subgroup of order 2 in Wy. 

Now write 71,...,%, © L for the zeros of f and view the Galois group 
Gal(L/K) as a subgroup of G,. Assume for a moment Gal(L/K) = G, and 
consider 2l,, as a subgroup of Gal(L/). Then the corresponding intermediate 
field E, of L/K can be described quite easily. One looks at the element A = 6? 


for 
6=||(#--;), 
i<j 
where A is the discriminant of the polynomial f; cf. Section 4.4. Note that 
A # 0, since f was assumed to be separable. Furthermore, A is invariant under 
all permutations 7 € G, and therefore belongs to Kk. In Section 4.4, see in 
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particular 4.4/10, we have shown in more detail how to compute A from the 
coefficients of f. In addition, if char K 4 2, the square root 6 of A is invariant 
under a permutation 7 € ©, if and only if 7 is even and thus belongs to 2,. 
This shows that K(/WA) C L* = E, and, due to VA ¢ K, even K(VA) = E,. 
Consequently, if char kK # 2, the step G, D 2, corresponds on the level of 
attached fields to the adjunction of a square root of the discriminant A. 

Proceeding as indicated in the general discussion above, we will derive now 
the special solution formulas for algebraic equations f(a) = 0 of degree < 4. 
Since it is not necessary, we will make no assumption on f to be irreducible or 
separable. However, assumptions on the characteristic of K will always ensure 
that the splitting field L of f is separable and therefore Galois over kK. We start 
with a quadratic polynomial f € KX], say 


foX’ +oX +6, 


where we assume char K # 2. We could arrive at the solutions quite easily by 
applying the method of completing the square, as usual. However, for illustra- 
tion, let us show how to argue in terms of the discriminant. Therefore, let L be a 
splitting field of f over K, and let x1, 22 be the zeros of f in L. The discriminant 
of f is determined to be A = a? — 4b. Then 6 = 2; — x is a square root of A, 
and since x; + ®2 = —a, we get 


«1 =3(-a+0), t= 3(—a— 9), 


resp. 


Lj 2 = 5 == ——b. 


These are the well-known formulas for the solutions of quadratic equations. 
Next we consider a cubic polynomial f € K[X], say 


f=X°+0X?+bX +e 


where we assume char K # 2,3. Replacing X by X — Fa, we can reduce to the 
case that f is of the somewhat simpler form 


f=X?+pX+¢. 


Again, let ZL be a splitting field of f and write 71, 22,273 for the zeros of f in 
L. The discriminant of f is A = —4p? — 27q?; see the computation following 
4.4/10. To solve the equation f(a) = 0, it is helpful to look first at the case 
Gal(L/K) = G,,, which will be referred to as the generic case. However, our 
considerations will be valid for arbitrary Galois groups Gal(L/K), as we will 
see at the end. 

Considering the normal series G3 D 23 D {1}, we start by adjoining to K 
a square root 


) => (x1 X2)(x1 x3) (X29 x3) = VA 
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of the discriminant A. Then, in the generic case that we are following for the 
moment, we recognize L/ (6d) as a cyclic Galois extension of degree 3. Moti- 
vated by the assumptions in 4.8/3 (i), we adjoin to AK’(d), resp. K, a primitive 
third root of unity ¢ and assume from now on for simplicity that ¢ € kK. Then 
L/K(6) is obtained by adjoining a third root of some element of K(6). Retrac- 
ing the constructions in 4.8/3 and 4.8/1 shows that we can choose this root as 
a Lagrange resolvent 


(¢,2) =2+Co(2) + Co(x) 


for a suitable element x € L, where o is a generating element of the cyclic group 
Gal(L/K(0)). 

Since such an element x € L cannot be specified in an obvious way, we use 
the zeros 21, 2,23 of f to define “resolvents” by 


1,2) = 71 +X + £3 = 0, 
¢,£) = 21+ (x9 + C7 a3, 
x 


1+ Cx + C23, 


where for reasons of motivation, we could imagine x, a(x), as well as o?(a) at 
the place of x1, 72, and x73. Recalling that the primitive third roots 


©) C=-p43V-3, CO =-3- V3, 


are the zeros of the cyclotomic polynomial ; = X? + X + 1, we can conclude 
that the zeros 21, %2,.x3 of f are determined by 


3((¢,2) + (C7,2)), 
(1) t= a(67(¢,2) + ¢(¢?, 2)), 
Ge) eC a). 


Thus, if we can get hold of the occurring resolvents, we will be done. Due to 
the fact that in the generic case, the extension L/K (6) admits 23 as its Galois 
group, the third powers of the resolvents (¢,a) and (¢?,x) are invariant under 
Gal(L/K(6)) and therefore are contained in K’(d). We want to justify this also 
by direct computation. Indeed, writing 


t= 


r3 = 


6 = (x1 — %2)(x1 — ©3)(22 — £3) 


2 2 2 2 2 2 
= ULQ + 19X3 + L3X1 — L{@3 — LOX, — L3%Q 
and using (0), we get, independently of the generic case, 


(C,0)° = a? +22 + a8 + 8¢ (aha + wha + 2521) 
+ 3¢7(xiar3 + xr, + x32) + 6212973 
=), 0) — 3 ig, wip t+ Samay + 3-3 + 6. 
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Here a special choice of the square root /—3 does not matter, since replacing 
V/—3 by —V—3 has the effect that the quantities ¢ and ¢?, resp. (¢,v) and 
(¢?, x), are interchanged. In particular, we can determine (¢?, x)* by replacing 
/—3 by —/—3 in the above formula. 

Now let us view (¢, x)? as a symmetric function in 21,22, 73 and write it as 
a polynomial in the elementary symmetric polynomials 


01 = $1(%1, 02,3) = 2%, + 22+ 23 =0, 
Og = 89(11, 29,03) = 21 X2 + 2123 + Lory = p, 


03 = 83(X1, XQ, 3) = 11%Q73 = —q, 


in order to get an expression in terms of the coefficients p, q of the equation 
under consideration. To do this, we follow the method of Proposition 4.3/5. 


(¢, 2)? = , 0? peer v7a; 16211203 +3V-3- 4 
= +3 ie; rir; +62122%3 
tS igg via, +ay-3 x) 
—20\02 = Se ain; — Fark, 
2112 2X3 +3/-3 4 n) 
Zo. a 71 22o3 


Therefore, we obtain (¢, x)* = of — 20102 + 203 + 3./—3- 6, and hence due to 
o, = 0 and 03 = —q, 


(2) (Gay eget gy 3b = — fa Foy Ge gy, 


as well as 


(3) Cha) = Ba V3 8 = — Fa — 27) + 


The resolvents (¢,«) and (¢?,x) are uniquely determined by these equations, 
up to a third root of unity. Furthermore, the equations (1) for 21, 22,273 show 
that we may replace (¢,x) by ¢(¢,x) if at the same time we replace (¢?, x) 
by ¢7(¢?, x). This suggests that the third roots of the right-hand parts of the 
equations (2) and (3) cannot be chosen independently of each other in solving 
the equation x?+pxr+q = 0 by means of the relations (1). Indeed, the calculation 


(¢,a)(C?, x) = (@ + Cte + Ca3)(ay + Cay + Cag) 


= ap +03 + 23+ (C+ 07) (aire + 2123 + 223) 
= g? — 302 = —30) = —3p 


confirms this matter. Therefore, we can state the following: 


268 6. Applications of Galois Theory 


Proposition 1 (Cardano’s formulas). Let K be a field satisfying char K 4 2,3. 
For coefficients p,q € K, the solutions of the algebraic equation x? +px+q=0 
are given by 


w=utv, wt =Cut cv, x3 = Cut cn, 


where C € K is an arbitrary primitive third root of unity, and where 


2+) +(2) -2-¥G) +) 
" Gye): 2 3) t\9)> 


with the side condition that the third roots are chosen to satisfy wv = —}p. 


Proof. If we replace in the above expressions u,v by Cu, ¢?v, resp. C7u, Cv, this 
only produces a permutation of x1, %2,73. Therefore, we may assume without 
loss of generality that u = 4(¢,x), as well as v = 3(¢?, x). Then, due to the 
equations (1), the quantities 71, 22,x3 mentioned in the assertion will coincide 
with the solutions of the equation x° + px + q = 0. 


Finally, let us consider a quartic polynomial f € K[X], say 
f =X* + px? + 9X +7, 


where we assume char K 4 2,3 again. Every monic polynomial of degree 4 can 
be transformed to this type by applying a substitution X -—> X — FC, for a 
suitable constant c € K. Furthermore, let 21, x%2,73,x4 be the zeros of f ina 
splitting field Z of f over K. Since the term of degree 3 in f is trivial, we get 
Ly + 22+ 43 +24 = 0. Similarly as we did for polynomials of degree 3, we start 
our consideration by looking at the generic case in which Gal(L/K’) = Gy. In 
addition, we assume in this case that 21, 22,23 are algebraically independent 
over the prime field of AK. For example, such a setting is obtained in trans- 
forming the generic equation of degree 4 to the special type we are considering 
here. However, it should be observed that the computations below are valid 
independent of such special assumptions. 
In the generic case we look at the normal series 


64D XD W,d3> {1} 


already mentioned before, as well as at the corresponding chain of field exten- 
sions 


KCIMCL™® CICL. 


As usual, we get L** = K(6) for a square root 6 of the discriminant A of f, 
where one can show that 


A = 144pq?r — 128p*r? — 4p3q? + 16p*r — 27q* + 256r° 
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using 4.4/10. However, we will not need this in the sequel. The extension 
L™+/L*4 is of degree 3 and hence is generated by an arbitrary element of L™* 
that is not contained in L*™, for example by 


2 = (a1 + 2)(a3 +24) € L. 


Indeed, 2; is not left fixed by the permutation (1, 2,3) € 24. On the other hand, 
z, is invariant under all elements in W,, and in addition, under the permutations 
(1,2), (3,4), (1,3, 2,4), and (1,4, 2,3). Together, these are the members of the 
stabilizer subgroup of z; in G4. Using the orbit-stabilizer lemma 5.1/6, we can 
conclude that the orbit of z; under the action of G4 consists of precisely three 
elements, namely 


A= (x1 + x2) (x3 + £4), 
2 = (41 + 3) (a2 + £4), 


23 = (41 + 4) (22 + 23). 


But then 21, zo, z3 are the solutions of an equation of degree 3 with coefficients 
in K, namely of 


2 — bz? + bez — bs = 0, 


where 1, bo, b3 are the elementary symmetric polynomials in 21, 22, 23; in more 
detail,! 


by = 2 + 22 + 23 = 2 ics TEs, 
by = 2122 + 2123 + 223 = Vie; apay + 3D i, Casey + 621227304, 
bz = 212223 = ik a} UFLp +2 ae it EUGURU 
+25S> otek 445° UE LG TRL 
i<j<k s kel. 


As we see, b;, b2, bg are symmetric in 21, 22,73, 74. Therefore, we can write the 
b; as polynomials in the elementary symmetric polynomials 


Cy = 81 i; W538) = DG = 0, 
02 = $9(%1, £2, 03,24) = Dic; Vik; =p, 
3 = $3(X1, ©2,%3,X4) = Vc jcp ViXjLk = —4, 
O4 = S4(X1, Lo, ©3, L4) = 11T2T3X4 aig 


and hence express the }; in terms of the coefficients p,q,r of the algebraic 
equation we started with. First, we read 6; = 209 = 2p. Next, to describe bo, 
we follow the method from the proof of Proposition 4.3/5: 


| In subsequent summations, the indices vary over {1,2,3,4}. Different indices of a sum 
are allowed to assume only different values. 
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= ies % vj 
a = Vice ae a 2D ut 52 jLp +621 Lo03X4 


paA tas Lk 
J<k 


+3 y, i Tit ite +627 T9304 


0103 = s, T7204 L_ +40 102X304 
—42 1 X9X3X4 

—4o4 = —42 129X324 
0 


This yields by = 0? + 0103 — 404 = p? — 4r, since a, = 0. Finally, we express b3 
in terms of 01, 02, 03, 04: 


bg = Dies Ld es +2 days it D0, oy +2 Vasile ©7050; +4 Lisi x? PULL] 

010203 ay 7 tet +3 sO xj, 2p21 +3 Vaiek ao, +8 Lisi Eat 

7 a a UPEjTRE. — Dicjce Bit; TE —4 diss UF LZLKL 

—0704= =e Ute Dy apt ei 
j<k<l kel 

= licick UF x; Lp —2 Lisi a} UF TEE 

-= ee en rE —2 visi U2 eee 

0 

This shows that b3 = 010203 — 0704 — 0? = —q”, again since a, = 0. Thereby, 


independently of the generic case, we recognize 21, 22, 23 as the solutions of the 
equation 
2° — Qp2” + (p? —4r)z+q7 =0. 


This equation (or its inherent polynomial) is sometimes referred to as the resol- 
vent cubic of the equation «++ px?+qxr+r = 0. Note that the solutions 21, 22, 23 
of the resolvent cubic can be determined by means of Cardano’s formulas. 

Looking at the generic case again, we see that Wy, is characterized as the 
subgroup of Gy fixing 21, 22, 23. This implies Gal(Z/K (21, ze, 23)) = Wa, as well 
as K (2, 22,23) = L™. Thus, to pass from K(z1, 22,23) to L we would have to 
adjoin square roots according to the chain UW, D 3 D {1}. Now observe that 
x, + £ is invariant under the permutations (1) and (1, 2)(3,4) of Wy, but not 
with respect to the remaining elements of %J,. Therefore, 7; + x2 is of degree 2 
over K(21, 22, 23). Indeed, independently of this, the equations 


(x4 + X2) (x3 + L4) = 2%, t+2@+23+2%4=0 


yield 
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%+%2 = V-AZ1, t3+%4=—-V—Z1, 


for a suitable square root of —z,. Likewise, we have 


v1 1 3 = V— 22, tg 1 t4 = ~V—2%2, 


Ly +l, = V—23, Lg + ©3 = —V—23, 


and as a consequence, 


x, = 3( V—21 + V—z2 + V—23), 
ta = 35( V—21— V—22— V—2a), 
tg = 3(-V—-a1 + V—2 — V—23), 
r4 = 3( V—4 — V—% + V—23). 


Similarly as we have done for cubic equations, let us examine how to choose the 
square roots of —2,, —22, —z3. Observe for this that 


(x1 + £2)(x1 + ©3)(1 + ©) = 27(1 +o +23 +24) + Vicjck VIP ik 


= ae LLL 


2 


It follows that the square roots \/— 21, /—22, /—23 must satisfy the side condi- 


tion 
V—-41'°V 22° VV 23 = — Ed. 


Therefore, we can state the following result: 


Proposition 2. Let Kk be a field of characteristic char K 4 2,3. For coefficients 
p.q,r € K, the solutions of the algebraic equation x4 + px? + qx +r = 0 are 
given by 


r = 3( /—21 ++/—m +-/=23), 
Ly = 5( V/—%1 — V—22 — V—2a), 
a3 = 3(-V—21 + V—22 — V—23), 
tq = 3(-V—41 — V—22 + V—2), 


where 21, 22, 23 are the solutions of the resolvent cubic 
2" — Qpz” + (p*? — 4r)z + q? =0, 
and where the square roots are required to satisfy the side condition 


V=21 + V—22: /—23 = —4. 


Finally, let us add that due to the relations 
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2 — 22> (x4 £4) (a2 3), 
41 — 23 = (x1 ©3)(X2 La), 
22 23> (x1 X2) (x3 £4), 


the discriminant of X*4 + pX?+qX +r coincides with the discriminant of the 
resolvent cubic X3 — 2pX? + (p* — 4r)X + q’. 


Exercises 


1. Let K be a subfield of R, the field of real numbers. Furthermore, let f,g © K[X] 
be monic irreducible polynomials of degree 4, resp. 3, such that g(z) = 0 is the 
resolvent cubic of the algebraic equation f(x) = 0 (assuming that the term of 
degree 3 in f is trivial). Determine the Galois group of the equation f(x) = 0 
under the assumption that f does not admit zeros in R. 


2. Let K bea field and L a splitting field of a quartic polynomial f € K[X], where 
we want to assume that the term of degree 3 in f is trivial. Furthermore, let L’ 
for K Cc L’' C L bea splitting field of the resolvent cubic of f. View the Galois 
group G = Gal(L/K) as a subgroup of Gy and show that GM Wy is a normal 
subgroup in G such that Gal(L//K) = G/(GN Wu). 


6.3 Fundamental Theorem of Algebra 


In the past, various attempts at studying the algebraic structure of classical 
fields like R and C have given stimulus to further develop the general theory 
of fields and their extensions. The fundamental theorem of algebra is a promi- 
nent example of this. Its proof goes back to Euler and Lagrange and will be 
given below in terms of methods from algebra. But there exist other proofs, for 
example using techniques from complex analysis. 


Theorem 1. The field C of complex numbers is algebraically closed. 


Proof. Recall that C is constructed from the real numbers R by adjoining a 
square root of —1. To show that C is algebraically closed we must rely on 
certain properties of R. In fact, we will use the following properties: 

Every polynomial f € R[X] of odd degree admits a zero in R. 

Every element a € R, a > 0, admits a square root in R. 
The second property implies, as we want to show, that every quadratic poly- 
nomial in C[X] admits a zero in C. To prove this it is enough to verify that 


every element z € C admits a square root in C. Therefore, consider an element 
z=xu+iy€C, where x,y € R. To write z as a square in C, ie., 


z=a2+iy = (a4+ ib)? = a? — B+ Qiad 


for elements a,b € R, we have to solve the equations 
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r=a'—0*, y = 2ab, 


in the unknown quantities a and b. A straightforward computation shows that 
up to the sign of a and b, these equations are equivalent to 


1 1 il 1 
a’ = sat aV@ ty, =—set gV@ ty. 


The occurring square roots are meant to be nonnegative, and the + sign is, 
in fact, a + sign, since negative values for a? and b? are excluded. Therefore, 
if we use the fact that nonnegative real numbers admit square roots in R, the 
existence of the desired solutions a and 0b is assured. 

Now consider a chain of field extensions RC C Cc L, where L/C is finite. 
To prove that C is algebraically closed, we have to show that L = C. Enlarging 
L if necessary, we may assume without loss of generality that L/R is a Galois 
extension. Let G = Gal(Z/R) be the corresponding Galois group and assume 


[L: R] =ordG = 2*m, where 2{m. 


Then we see that k > 1. Furthermore, it follows from 5.2/6 that G contains a 
2-Sylow group H, hence a subgroup of order 2*. Therefore, the fixed field L” 
under the action of H on L satisfies 


[L : | a2 [L* ; R] =m, 


due to the fundamental theorem of Galois theory 4.1/6. Since every real polyno- 
mial of odd degree admits a zero in R, as was mentioned before, we can conclude 
that m = 1, for example by applying the primitive element theorem 3.6/12. But 
then L is of degree 2* over R and hence of degree 2*~! over C. Furthermore, 
L/C is a Galois extension. Now, if L # C and therefore k > 2, we can use 5.2/4 
to see that there exists a subgroup H’ C G’ = Gal(L/C) of order 2*~?. The cor- 
responding fixed field L”’ satisfies [L : L?’] = 2*-? and hence [L”” : C] = 2. 
However, this is impossible, since every quadratic complex polynomial admits 
a zero in C, as we have shown. Consequently, we must have L = C and C is 
algebraically closed. 


The proof of the fundamental theorem of algebra given above is based on 
the theory of Sylow groups. In Exercise 2 below we suggest a different proof 
avoiding this theory. Furthermore, let us add that from a purely algebraic point 
of view, the field R of real numbers is not unique as a subfield of C, since 
there exist automorphisms of C that do not map R to itself; see, for example, 
Exercise 2 of Section 7.1. However, there are deeper reasons for the fact that 
C, as an algebraic closure of R, is of degree 2 over R, as shown by the following 
result going back to E. Artin. 


Proposition 2. Let K be a field, K an algebraic closure of K, andi € K an 
element satisfying i? = —1. Then K = K(i) if [K : K] < oo. If, in addition, 
K/K is a nontrivial extension, we have char K = 0. 
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Proof. Assume that the degree [K : K] is finite. Since K is algebraically closed, 
the extension K/K is normal, and we claim that K’/K is even Galois. Indeed, 
assuming char K = p > 0, there is an intermediate field L of K/K, due to 3.7/4, 
such that K/L is purely inseparable and L/K is separable. Then consider the 
Frobenius homomorphism ao: K —> K, x ++ x”, and observe that o is an 
automorphism, since K is algebraically closed. Since o(L) C L and the degree 


[K +L] =|o(kK):0(L)| = [K+ o(f)| 


is finite, we get o(L) = L. However, this shows that L does not admit a non- 
trivial purely inseparable extension. Therefore, we have L = K, and K/K is 
Galois. 

It follows that K/K(i) is a finite Galois extension as well, and we have to 
show that this extension is trivial. Suppose that such is not the case. Then there 
is a subgroup in Gal(//K(i)) whose order is prime; for example, use 5.2/8 or 
5.2/11. If L C K is the corresponding fixed field, then the degree [K : L] is 
prime as well, say [kK : L] = ¢, and K/L is a cyclic Galois extension of degree 0. 
Assuming p = char K > 0, let us first consider the case ¢ = p. Then K = L(a) 
by 4.8/5 (i) for some element a € K whose minimal polynomial over L is of 
type X? — X —c. To obtain a contradiction, consider the map tT: K —> K, 
xt x? —x. It is surjective, since K is algebraically closed. Furthermore, using 
the relation trz,, (2?) = (tryz,,(x))? derived from 4.7/4, we conclude that 


trz/, OT = TIL 0 tre yp: 


Since both 7 and trz,;, are surjective, see 4.7/7, it follows that T|z is surjective 
as well. However, then X”? — X — c admits a zero in L, in contradiction to the 
fact that this polynomial equals the minimal polynomial of a over L. 

Next assume that the prime degree ( = [K : L] is different from the 
characteristic of K, and choose a primitive (th root of unity Cp € K. Since it is 
of degree < ¢ over L by 4.5/7, the multiplicativity formula 3.2/2 shows that we 
must have ¢, € L. In particular, the result 4.8/3 (i) becomes applicable, and we 
conclude that K = L(a) for some element a € K whose minimal polynomial over 
L is of type X‘—c. Now let a € K be an ¢th root of a, i-e., an element satisfying 


a* =a. Then the multiplicativity of the norm of K over L in conjunction with 


4.7/2 (ii) yields 


Ngxjzp(a)® = Nxjz(a") — Nx/1(@) = ( De 


Therefore, if ¢ is odd, Ngz,,(a) € L is an (th root of c, contradicting the fact 
that the polynomial X‘—c € L[X] is irreducible. Finally, if 0 = 2, we see that 
Ng /1(@) € L is a square root of —c. But then, since 7 € L, also c admits a 
square root in L. Again, this contradicts the irreducibility of the polynomial 
X?—c€ L[X], thus establishing the desired equality K = K(i). 

Now assume K © K(i) = K, so that in particular, —1 is not a square in 
kK. To prove char K = 0, we show that the sum of two squares in K is again 
a square in Kk. Indeed, consider two elements a,b € K. Then a+ ib admits a 
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square root in K(i), say «+iy, and we have x? — y? +2ixy = a+ib. This implies 


a= x? — y*, b = 2xy, and therefore 


ae 4 2 = (x? =. y’y? te Ag*y? = (x? Ae y’). 


Furthermore, we can use induction to show that the sum of finitely many squares 
in K is again a square in Kk’. But then, given any field of positive characteristic, 
the element —1 can be written as a repeated sum of the unit element 1 = 1? and 
hence is a square. However, since —1 is not a square in kK by our assumption, 
the case of positive characteristic is excluded, and we have char K = 0. 


Exercises 


1. Specify the arguments needed to establish the basic properties of real numbers that 
were used in the proof of Theorem 1, namely that every real polynomial of odd 
degree admits a zero in R and that every element a € R, a > 0, admits a square 
root in R. 


2. Let f € R[X] bea nonconstant polynomial of degree n = 2*m, where 2{m. Show 
by induction on k that f admits a zero in C, and deduce the fundamental theorem 
of algebra from this fact. Hint: Assume f to be monic and factorize it over an 
algebraic closure R of R into linear factors, say f = [[?_,(X — a). Then set 
Aw = Uy + ay + ba,a, for arbitrary b € R and apply the induction hypothesis 
to the polynomial g = [| X — Q,,). The properties of R, as specified in 
Exercise 1, may be used. 


ae 


3. For a field K, consider a polynomial X" — c € K[X] of degree n > 2, where 
c # 0. Generalizing the methods used in the proof of Proposition 2, show that 
X” — ¢ is irreducible if and only if c is not a pth power in K for any prime p 
dividing n and if, in addition, for 4|n, the element c is not of type c = —4a* for 
some a € K. Hint: Start with the case that n is a prime power. 


6.4 Compass and Straightedge Construction 


In the present section we apply Galois theory to the study of geometric construc- 
tion problems that are given in the complex plane C. We fix a subset M Cc C 
(later it will be 17 = {0,1}) and say that a point z € C can be obtained via 
compass and straightedge constructions from M if by means of finitely many 
basic construction steps, M can be enlarged to a subset M’ C C such that 
z € M’. The following basic construction steps are allowed: 


(1) Consider two nonparallel straight lines g; and gg in C, determined by 
points 21,22 € M, resp. z3,z, € M, and add to M the intersection point of g; 
and gp. 


(2) Consider a circle K in C with center a point z; € M and with radius 
given by the distance |z3 — zg| between two points 29,23; € M, as well as a 
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straight line g determined by two points z4, z5 € M, and add all intersection 
points of K and g to M. 

(3) Consider two distinct circles Ky and Kk, in C with centers 2, z2 € M, 
and with radii |z4 — 23], resp. |zs — 25|, given by the distances between points 
23,24 © M, resp. 25, 26 € M. Add to M the intersection points of Ky, and Ko. 


We write R(/) for the set of all points in C that can be obtained by compass 
and straightedge constructions from M, where we will always assume 0,1 € M. 
Furthermore, let MW be the image of M with respect to the complex conjugation 
C — C, z +> z? Then clearly we get &(M) = &(M UM), since we can 
use compass and straightedge constructions to mirror any point z € M —R on 
the real axis R, thereby obtaining the complex conjugate Z of z. Indeed, if z is 
purely imaginary, i.e., with real part Re z = 0, we obtain Z as an intersection 
point of the line through 0 and z with the circle having center 0 and radius 
|z| = |z — 0|. Otherwise, look at the circle with center z and with radius |z]. It 
has two intersection points 2), 22 with the line through 0 and 1, and we obtain 
Z as an intersection point of the circles with centers z, and z2, in both cases 
with radius |z]. 

To characterize the set A(M) in terms of algebraic field extensions, we start 
out from the smallest subfield in C containing M and M, which equals the field 
Q(M U M), obtained from Q by adjoining all elements of MU M. 


Proposition 1. Let M C C satisfy 0,1 © M. The following conditions are 
equivalent for any point z € C: 

(i) z € R(M). 

(ii) There exists a chain of field extensions 


QMUM)=IpC hy c...ci,cect 


such that z € Ly and [L;: Li-1] =2 fori=1,...,n. 
(iii) z is contained in a Galois extension L of Q(M UM) whose degree 
[L:Q(MU™M)] is a power of 2. 


As a direct consequence, this implies the following: 


Corollary 2. Let M C C satisfy 0,1 € M. Then R(M) is an algebraic extension 
field of QM UM). The degree over Q(M UM) of any element z € &(M) is a 
power of 2. 


Proof of Proposition 1. We start with the implication (i) => (ii) and show first 
of all that M can be assumed to be a field that satisfies IM = M, as well as 
i € M for the complex number i € C, square root of —1. Indeed, the complex 
conjugation on C restricts to an automorphism on the field Lp = Q(M UM), 


? In the present section, we use the notation M for the image of M with respect to the 
complex conjugation map, even if M is a field; an algebraic closure of such a field M Cc C, 
usually denoted by M, is not needed. 
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so that Lo = Lp. Then L, = Lo(i) is an extension field of Lo of degree < 2 
satisfying L, = L, and i € Ly. Furthermore, we have &(M) C &(L;) as well as 
Q(L; UL;) = Ly. Hence, it is enough to verify the implication from (i) to (ii) 
for L, instead of M. In other words, we may assume M to be a field satisfying 
M=Mandice M. 

From this assumption we conclude for any point z € M that its real part 
Rez = $(z + 2), its imaginary part Im z = 3(z — 2), as well as the square 
|z|? = 2Z of its absolute value all belong to M. Now consider a point z € &(M). 
It is enough to look at the case in which z is obtained from M by a single basic 
construction step and to show that z is contained in M or in an extension field 
L = M(VA) obtained from M by adjoining a square root of a nonnegative real 
number A € MMR. Then, due to the fact that L = L and i € L, the general 
case follows using an inductive argument. 

We start by assuming that z is obtained from M by means of a basic con- 
struction step of type (1). Then z equals the intersection point of two lines 


n= {24 +t(z—-m);teE R}, 
on = (25+ tea); # ER}, 


where 21, 22, 23, 24 € M, and we have to solve the equation 
att(ze — 2%) = 23 4+ t' (24 — 23) 


for the parameters t,t’ € R. Splitting the equation into its real and imaginary 
parts yields a system of two linear equations with coefficients in RM M for the 
unknown quantities t, t’, which admits a unique solution (to, tj) € R?. Since this 
solution can be calculated from the coefficients of the equations by means of 
rational operations, for example using Cramer’s rule, we get to, tj € RN M and 
therefore 

z= 2 + to(ze— 21) = 23 + to(24 — 23) € M, 


so that in this case it is not necessary to extend M. 
Next we assume that z is obtained from M by applying a basic construction 
step of type (2). Hence, z is an intersection point of a circle 


K={CeC; |¢— a)? = zs — 20/?} 
and a straight line 
g = {24 + t(25 — 24); t€ R}, 


where 2,...,25 € M. To determine the intersection points of K and g, the 
equation 


|z4 + t(25 — 24) zal? = |2¢ al? 


has to be solved for t. This is a quadratic equation for t with coefficients that 
are obtained by means of rational operations from the real and imaginary parts 
of z1,...,25 and hence belong to RM M. As a consequence, the equation can 
be written as t? + at + b = 0 for suitable coefficients a,b € MMR, and we see 
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that the intersection point z € K Mg under consideration corresponds to a real 
solution to of t? ++at+6 = 0. Then the discriminant A = a? — 4b of the equation 
satisfies A > 0, and we get to € (MMR)(VA), as well as z = 24+to(zs—z4) € L, 
where L = M(vVA). 

It remains to consider a basic construction step of type (3). Therefore, let 
z be an intersection point of two distinct circles 


- —_—. 
Ky={C eC; |¢-al?=rij}, 
2_ 22 
Ko ={CeC; |¢- a)? =r3}, 
where ry = |24—23|, T2 = |26—25|, 21,---, 26 € M. Then z satisfies the equations 
22 — 22, —-74 +221 = r?, 
Zz — 229 — 22g + Zoz9 = Te, 


and taking their difference, an equation of type 


az+@z+b=0, ie, 2Re(az)+b=0, 

where a = 2% — 2, € M and b € MOR. Since Ky and Ko have different 
centers, we must have a # 0. Hence, viewing z as a parameter, the latter 
equation describes a straight line g, which in particular, contains the points 
=, — € M. Furthermore, the intersection points of g with AK, and Ky coincide 
with those of AK, and Ky. Therefore, we can proceed in the same way as we did 
for basic construction steps of type (2). This ends the proof of the implication 
(i) ==> (ii). 

To prove the reverse implication (ii) ==> (i), it is enough to show that RUM) 
(assuming 0,1 € M) is a subfield of C containing MUM and hence Q(MUM), 
and that for every element z € (/), its square roots +./z belong to &(M). 
To verify these facts, we refer to the relation A(M) = &(M UM) and establish 
the following assertions, where some of them are needed only for proof-theoretic 
reasons: 


a) 21,22 € R(M) => 241+m€ R(M), 
(b) z € R(M) 2 € R(M), 

) 2€ R(M) = |2| € A(M), 

) 


(d) e™2 =54 5-iv3€ R(M), 
e) 21,22 € RIM) => |z1||z2| € RLV), 
(f) z € &(M),z 40 => |2|“1 € R(M), 
g) 21, 22 © R(M) = 22 € R(M), 
(h) z€ A(M),z 40 = 21 € AM), 
(j) 2 € R(M) t/z € A(M). 
Each of the preceding assertions can be verified by a simple geometric con- 
struction. Concerning (a), interpret the addition of complex numbers as vector 
addition. The “vector” z + z2 corresponds to the diagonal of the parallelogram 
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spanned by the “vectors” 21,22. For (b) apply to z a point reflection at the 
origin. Indeed, —z is situated on the line through 0 and z (assuming z # 0), 
as well as on the circle with center 0 and radius |z| = |z — 0]. Similarly, in (c) 
we can interpret |z| as an intersection point of the real axis and the circle with 
center 0 and radius |z|. Next, assertion (d) is needed for the verification of (e) 
and (f), where we have to know that &(/) contains a nonreal point apart form 
the points 0,1. To establish (d) construct an equilateral triangle over the line 
segment from 0 to 1. Its top vertex, intersection of the circles about 0 and 1 with 
radius 1, is then the primitive sixth root of unity e™/3 = 4+ 4-iV3. Turning 
to the implications (e) and (f), assume z; 4 0 ¥ z. and consider the following 
figure: 


To draw it, choose a point 29 = |Zo|e’* in M—R such that Rez > 0, for example 
zy = e™/%, and consider the line g through 0 and z. Then, on g we can look 
at the points p = e’* and q = |z2\e’*, as well as on the real axis at the point 
a = |z|. All these points belong to &(M), as is easily verified. Furthermore, on 
the real axis consider the point c that is obtained as the intersection point of R 
and the parallel of ga, through q, where gq,» is the line determined by a and p. 
Also c belongs to R(M), as is easily verified using elementary constructions. For 
example, drop a perpendicular from gq to the line g,,,, and on this perpendicular, 
draw another perpendicular at g. Then we can read from the intercept theorem 
that 


lal - Welt = lel Jal, 


whence, due to |q¢| = |z2|, |p] = 1, and |a| = |z|, that 


Cc 


= lal lal = laa] + |2al 


and therefore |z| - |z2| € RUM). Furthermore, constructing the parallel of gap 
through 1 € R, its intersection with g yields a complex number of absolute 
value |2;|~', where |z;|~' € R(M) by (c). To settle the implications (g) and (h), 
observe that in calculating the product of two complex numbers, their absolute 
values are multiplied and their arguments added. Therefore, it remains to show 
that addition and negation of angles is possible in terms of basic construction 
steps; however, this does not pose any problem. Since the same is true for the 
bisection of angles, it is enough to show for the implication (j) that given a 
point z € &(M) — {0}, the square root \/|z] can be obtained through basic 
construction steps. To achieve this, consider on the real axis the line segment 
from —|z| to 1 and construct Thales’s semicircle over it. Then, by the altitude 
theorem for right triangles, the intersection with the perpendicular erected at 
0 yields a complex number of absolute value J/l2l . Thus, we have shown that 
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&(M) is a subfield of C that is closed under the extraction of square roots. This 
ends the proof of the equivalence of conditions (i) and (ii). 

It remains to show that conditions (ii) and (iii) are equivalent. Assuming 
condition (ii), we consider in C the normal closure L of L, over K = Q(MUM); 
cf. 3.5/7. If o1,...,0, are the distinct K-homomorphisms of L, to C, we can 
interpret L as the field that is generated over K by all images o;(L,,), where 
i =1,...,r. Since L, is obtained from K by successive adjunction of square 
roots, the same is true for every o;(L,) and hence also for L. Therefore, L/K 
is a Galois extension whose degree is a power of 2. Since we have z € L, C L, 
condition (iii) follows. 

Conversely, assume that condition (iii) is given. Then the Galois group 
Gal(Z/F‘) is a 2-group and therefore solvable by 5.4/6. It follows that Gal(L/K) 
admits a normal series with cyclic factors of order 2; see 5.4/7. But then, by 
the fundamental theorem of Galois theory 4.1/6, this corresponds to a chain of 
field extensions as required in condition (ii). 


The assertion of Proposition 1 can be used on various occasions to show that 
certain points of the complex plane, or distances between such points, cannot 
be obtained from a given subset MM Cc C by means of compass and straightedge 
constructions. A famous example of this kind is the problem of squaring the 
circle. Its challenge is to use compass and straightedge constructions to turn a 
circle, given by its center and its radius, into a square whose area is equal to that 
of the circle. For instance, look at a circle of radius 1 and with center 0. Its area 
equals the number 7, so that a square of the same area will have edge length \/7. 
Therefore, the problem of squaring the circle consists in figuring out whether 
7m belongs to R({0, 1}). However, &({0,1}) is an algebraic extension of Q, as 
we know from Corollary 2. On the other hand, F. Lindemann proved in 1882, see 
[13], that 7, and hence \/7, is transcendental over Q. Consequently, \/7 cannot 
be obtained from {0,1} by means of compass and straightedge constructions, 
and it follows that squaring the circle in such a way is impossible. In the past, 
compass and straightedge constructions have been used on several occasions 
to arrive at excellent approximate solutions for 7 and ,/7. Sometimes such 
solutions were interpreted as exact solutions, which, however, is not permissible. 

Another classical problem that does not admit a solution is the problem of 
doubling the cube: Is it possible to double the volume of a cube by means of 
compass and straightedge constructions? For example, starting with a cube of 
edge length 1, doubling its volume leads to a cube of edge length 7/2. However, 
as we know from Corollary 2, the cube root W2 does not belong to &({0, 1}), 
since its degree over Q is not a power of 2. In a similar way, one can treat the 
problem of angle trisection; cf. Exercise 2 below. 

Finally, let us study applications of compass and straightedge construc- 
tions to regular polygons.? Important contributions to the subject are due to 
C. F. Gauss. The problem consists in finding out whether for a given integer 
n > 3 the regular n-gon is constructible, in the sense that the primitive nth 


3 Regular polygons will always be assumed to be convez. 
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root of unity e?"/” belongs to A({0, 1}). Note that here and in the following, the 
term “constructible” alludes always to compass and straightedge constructions. 
In the proof of Proposition 1 we have already shown that e™/? € A({0,1}). 
Therefore, the regular 6-gon is constructible. More generally, we prove the fol- 
lowing: 


Proposition 3. For an integer n > 3, the regular n-gon is constructible if and 
only if p(n) is a power of 2, where vy is Euler’s p-function (cf. 4.5/3). 


Proof. Let ¢, be a primitive nth root of unity over Q. Then we know from 4.5/8 
that Q(¢,)/Q is an abelian Galois extension of degree y(n). Assuming that the 
regular n-gon is constructible, ie., that ¢, € A({0,1}), and using Corollary 2, 
the degree of ¢, we Q, which equals y(n), is a power of 2. Conversely, if 
it is known that y(n) = [Q(¢n) : Q] is a power of 2, we can conclude that 
Gn € K({0, 1}) from . iciplicaiioh (iii) => (i) in Proposition 1. 


Using 4.5/4 (iii), it is easy to determine the values of the y-function: 


Een Senn Enea 


A number in italics for y(n) in the second row indicates that the regular n-gon is 
not constructible. In particular, the regular 7-gon is the first polygon in this list 
that is not constructible; the proof of this fact goes back to Gauss. In addition, 
Gauss was the first to give an explicit description of the (rather complicated) 
construction of the regular 17-gon; note that y(17) = 16 is a power of 2. 
Furthermore, let us mention the relationship between the constructibility of 
the regular n-gon and the decomposition of n into so-called Fermat primes. 


Definition 4. For ¢ € N, the integer Fy = 22° + 1 is called the (th Fermat 
number. A Fermat prime is a Fermat number that is prime, hence a prime 
number of type 22° +1. 


The Fermat numbers Fp = 3, F, = 5, Fh = 17, F3 = 257, Fy = 65537 are 
prime and hence Fermat primes. So far, these are the only Fermat numbers that 
are known to be prime. 


Proposition 5. The following conditions are equivalent for n > 2: 

(i) y(n) is a power of 2. 

(ii) There exist distinct Fermat primes pi,...,ppr and an integer m € N such 
that n = 2" py... Dr. 


Proof. Given a prime p, the expression y(p™) = (p — 1)p”~' is a power of 2 if 
and only if p = 2 or if p™' = 1,i.e.,m =1, and p—1 is a power of 2. Therefore, 
using the multiplicativity of the y-function, the assertion of the proposition can 
be derived from the following lemma: 
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Lemma 6. A prime number p> 3 is a Fermat prime if and only if p—1 is a 
power of 2. 


Proof. From the definition of Fermat numbers p we see that p—1 is a power of 2. 
Conversely, assume for a prime p that p—1 is a power of 2, say p = (22")\r +1 for 
an odd exponent r. Then, if r > 1, we can factorize p according to the formula 


1+a” =1-(-a)” = (1—(-a)) ((—a)""! + (-a)"? +...+1) 
into a product of nontrivial factors 


Oy Bie OF OP yd), 


However, since p is prime, we must have r = 1. 


To sum up, we can state that the regular n-gon is constructible if and only 
if n is a product of type n = 2™p,...p, for distinct Fermat primes pj,...,), 
and a natural number m. 


Exercises 


1. Consider a subset M C C such that 0,1 € M. Discuss the question whether an 
element z € C is contained in R(M) if its degree over Q(M UM) is a power of 
2. In particular, assume M = {0,1} and look at the case that z is of degree 4 
over Q. 


2. Look at the problem of angle trisection and check whether it admits a solution in 
terms of compass and straightedge constructions. 
3. Consider the extension &(1Z)/Q for M = {0,1} and show: 
(i) &UM)/Q is an infinite Galois extension. 
(ii) &(M) can be interpreted as the union of an ascending chain of Galois ex- 
tensions of Q whose degree in each case is a power of 2. 


(iii) Give a characterization of the group Gal(.(M/)/Q) using the notion of pro- 
jective limit; see Section 4.2. 


4, Exhibit an explicit compass and straightedge construction for the regular 5-gon. 


7. Transcendental Field Extensions ®) 


Check for 
updates 


Background and Overview 


Looking at the domain of rational numbers Q, it was realized quite early that 
certain familiar “numbers,” such as V2, are not rational and are hence irra- 
tional, as one began to say. There were several attempts to classify irrational 
numbers. For example, Galois theory provided a first means to access at least 
the algebraic ones among the irrational numbers, hence those satisfying a non- 
trivial algebraic equation with coefficients in Q. Shortly thereafter it was dis- 
covered that algebraic irrational numbers make up only a “very small” part of 
the realm of all irrational numbers, i.e., that most of them will not satisfy a 
nontrivial algebraic equation with coefficients in Q. Such numbers were called 
transcendental. 

For a transcendental number over Q, such as 7, the simple field exten- 
sion Q(7)/Q is easy to describe. Indeed, the monomorphism Q[X] @ Q(z), 
X ++ 7, gives rise to an isomorphism Q(X) + Q(z), where Q(X) is the 
function field in a variable X over Q, thus the field of fractions of Q[X]. But 
how can we describe the structure of an extension L/Q from an algebraic point 
of view for more complicated subfields L C C, or even for L = C? An amazingly 
simple answer to this question was given by E. Steinitz in his groundbreaking 
work [15]. Indeed, for an arbitrary field extension L/K’, there exists a system 
t = (x;)ier of elements in L such that r admits the properties of a system of 
variables over K and such that L is algebraic over the “function field” K(r). 
The system r is referred to as a transcendence basis of L/K. However, it has 
to be observed that the intermediate field A(x) of L/K depends on the choice 
of x. Steinitz showed that transcendence bases somehow behave like bases of 
vector spaces and in particular, that every two transcendence bases of a field 
extension L/K are of same cardinality. We will explain all this in more detail 
in Section 7.1. 

The study of field extensions L/K without an algebraicity assumption is 
of interest not only for the extension C/Q, but also for questions in algebraic 
geometry. Fixing a field K with an algebraic closure K, the elements of the 
polynomial ring K [X1,...,X,] can be viewed as (polynomial) functions on K”; 
cf. 3.9. Likewise, the elements of the “function” field K(X1,...,Xn) give rise to 
rational “functions” on K”. Indeed, given h € K(X1,...,Xn), say h = f/g with 


fig € K[M,...,Xn], 9 4 0, and looking at points z € K”, where g(z) # 0, the 
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fraction h(z) = f(z)/g(z) is well defined in K. More generally, we can interpret 
every finitely generated extension field L = K(21,...,2,) of K in a similar way 
as a field of rational functions. Just consider the subring A = K[21,...,%] C L 
and use a representation A ~ K[X1,...,X,]/p of A as the residue class ring 
of a polynomial ring modulo a prime ideal p. As shown in 3.9, the elements of 
A can be viewed as polynomial functions on the zero set V(p) C K” of the 
ideal p, and likewise, the elements of L = Q(A) as rational functions on V (p). 
On the other hand, we will generalize in Section 7.3 the notions separable as 
well as purely inseparable, or primary as we prefer to say, from algebraic to 
more general field extensions. In particular, we will make plausible that these 
properties correspond to specific geometric properties of the attached algebraic 
sets V(p); see the end of Section 7.3 and its Exercise 4. 

To handle separable and primary field extensions in 7.3, we must be famil- 
iar with tensor products. Such products have already been studied in 4.11 in 
a quite restricted setting. However, now it is necessary to develop the corre- 
sponding basics in more generality; we do this in 7.2. Finally, in Section 7.4, we 
give a characterization of separable field extensions in terms of methods from 
differential calculus. 


7.1 Transcendence Bases 


In 2.5/6 we introduced the notion of algebraic independence, also referred to 
as transcendence, for finite systems of elements of a ring R’, relative to a ring 
extension R C R’. Restricting ourselves to fields, let us recall the definition once 
again. 


Definition 1. Let L/K be a field extension. A system (x1,...,2p) of elements in 
L is called algebraically independent or transcendental over K if every equation 
f(@1,--.;%n) = 0 for a polynomial f € K[X,...,Xn] implies f = 0, i.e., if 
the substitution homomorphism 


ni? 


K[X1.2-;Xn] FL, Yo Cag XTt XP Yo ty tt 0 


is injective. 

A system ¥ = (ai)ier of (arbitrarily many) elements of L is called alge- 
braically independent or transcendental over K if every finite subsystem of X 
is algebraically independent over K in the above sense. 


Thus, if X = (a;)ier is a system of elements in L that is algebraically 
independent over K, we can view the 2; as variables over Kk. In particular, 
the field K(X) generated by X over K equals the rational function field in 
the variables x;, 7 € I, which by its definition is the field of fractions of the 
polynomial ring A[X]. The extension L/K is called purely transcendental if 
L= K() for a system X that is algebraically independent over K. 
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Definition 2. Let L/K be a field extension and X an algebraically independent 
system of elements in L. Then X is called a transcendence basis of L/K if L 
is algebraic over K(X). 


Proposition 3. Let L/K be a field extension. A system X of elements in L is 
a transcendence basis of L/K if and only if X is a maximal system in L that 
is algebraically independent over K. In particular, every field extension L/K 
admits a transcendence basis. 


Proof. First assume that X is a maximal algebraically independent system of 
L/K. Then, by the maximality of ¥, every element of L is algebraic over K(X), 
so that X is a transcendence basis of L/K. Indeed, for « € L, the system 
obtained from X by adding z is no longer algebraically independent over K. 
Thus, there exists a finite subsystem (x1,...,%,) of ¥, as well as a nontrivial 
polynomial f € A[X1,...,Xn4i] satisfying f(x1,...,%,2) = 0. Since the 
elements 21,...,2, are algebraically independent over K’, we see that f, as a 
polynomial in X,,41, is of degree > 0. However, this means that x is algebraic 
over K(x1,...,%n) and hence over K(X). In particular, L is algebraic over K(X), 
and & is a transcendence basis of L/. On the other hand, every extension L/K 
admits a maximal algebraically independent system by Zorn’s lemma 3.4/5. 
Hence, it admits a transcendence basis. 

Conversely, assume that X is algebraically independent over K and that 
L/K(%) is algebraic. Then X is necessarily a maximal algebraically independent 
system of L/K. 


Next we want to show that an algebraically independent system of L/K can 
always be enlarged to a transcendence basis by adding suitable elements from 
a system generating L/K. Such an “exchange” argument will be used to show 
that any two transcendence bases of L/K are of same cardinality. 


Lemma 4. Consider a field extension L/K and a system XY) of elements in L 
such that L is algebraic over K ()). Furthermore, let X' C L be a system that is 
algebraically independent over K. Then X' can be enlarged by adding elements 
of X to yield a transcendence basis ¥ of L/K. 


In particular, there exists a subsystem in Q) forming a transcendence basis 
of L/K. 


Proof. Using Zorn’s lemma 3.4/5, we choose a maximal subsystem X” C Q) such 
that the composite system X = X’U X” is algebraically independent over K. 
Similarly as in the proof of Proposition 3, it follows that every element y € ) 
is algebraic over A(X). Then K(X, )) is algebraic over K(X), and the same is 
true for L/ K(X). In particular, X is a transcendence basis of L/K. 


Theorem 5. Every two transcendence bases of a field extension L/K have the 
same cardinality. 
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Before turning to the proof of the theorem, let us briefly explain how to 
compare cardinalities of sets; we will prove two auxiliary results on this matter 
below. However, since it is not really necessary for our purposes, we will not 
touch the formal definition of cardinalities using ordinals; instead, we refer to 
books on set theory for this. Up to now we have simplified things by looking 
at the order ord M of a set M from a naive point of view. For example, so far, 
ord M = oo has always indicated the fact that M consists of infinitely many 
elements, in the sense that it is not finite. However, when dealing with cardinal- 
ities one differentiates between different degrees of infinity. Two sets M@ and N 
are said to be of same cardinality, denoted by card M = card N, if there exists 
a bijection M@ —> N. Alternatively, we will use the notation card M < card N 
if there exists an injection M << N, or equivalently for M #4 9, a surjection 
N —> M. However, that a chain card M < card N < card M already implies 
card M = card N is far from being obvious in the case of nonfinite cardinalities; 
it is the assertion of the Schréder—Bernstein theorem that we will prove below. 
As usual, card M = n (resp. card M < n) for a natural number n means that 
M consists of precisely (resp. at most) n elements. 


Lemma 6 (Schréder—Bernstein theorem). Assume for two sets M and N that 
there are injections a: M > N and t: N — M. Then there is a bijection 
p: MON. 


Proof. We denote by M' Cc M the set of all elements « € M satisfying for every 
n € N the implication 


xe(too)"(M) = «€(rToa)"or(N). 


In other words, an element x € M belongs to M’ if and only if on taking repeated 
preimages of type x, T71(x), o-!771(x), t7107!7 H(z), ..., this yields either 
a chain of infinite length, or one that is finite and ends at an element in N. 


Hence, we can define a map p: M —>+ N by 


_ Jota) forze M’, 
lz) = ee for x € M’. 


Then p is injective, since the restrictions p|yy and p|jy_j are injective, and 
since p(x) = p(y) for  € M’ and y € M—M’ implies o(y) = 7~1(x), hence 
Too(y) = « € M’. However, this yields y € M’, contradicting the choice of 
y. Furthermore, we claim that p is surjective as well. Indeed, fix z € N and 
look at x = T(z). If  € M’, we have p(x) = 7~!(x) = z. On the other hand, 
if « ¢ M’, we conclude that z admits a preimage y = 0 '(z) = a '(771(z)), 
since otherwise, « would belong to M’. Since x ¢ M’, we get y ¢ M’ as well, 
and hence p(y) = a(y) = z. 


Lemma 7. Every infinite set M is a disjoint union of sets that are countably 
infinite. 
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Proof. Consider the set X of all pairs (A, Z), where A is an infinite subset of M, 
and where Z is a disjoint decomposition of A into countably infinite subsets. 
In other words, Z is a system of countably infinite disjoint subsets of A whose 
union equals A. Since M is infinite, we get X # Q. Furthermore, we write 
(A, Z) < (A’,Z’) for two elements of X if A is contained in A’ and if Z is a 
subsystem of Z’. In this way, we have defined a partial order on X, and it is 
immediately clear that every totally ordered subset of X admits an upper bound 
in X. Therefore, X admits a maximal element (A, Z) by Zorn’s lemma 3.4/5. 
Now observe that the difference M — A is finite, due to the maximality of 
(A, Z). But then, enlarging an arbitrary element of the decomposition Z by the 
set M — A, we obtain a decomposition of M into disjoint countably infinite 
subsets, as desired. 


Now we are able to carry out the proof of Theorem 5. Let X and Q) be two 
transcendence bases of L/K, where for the purposes of our proof, both systems 
are viewed as subsets of L. First, considering the case that X is finite, say 
X = {x1,..., Up}, let us show that card Y) < card ¥ by induction on n = card X. 
Then we get card 2) = card X by reasons of symmetry. The base case n = 0 is 
trivial, since L/K is algebraic then. Therefore assume n > 0. In this case, 
L/K is no longer algebraic, and consequently, 2) is nonempty. Therefore, there 
exists an element y € 2). Applying Lemma 4, we can enlarge the system {y} 
by adding elements of 2% in order to construct a transcendence basis 3 of L/K. 
Then, necessarily, card 3 < n, since X, as a maximal algebraically independent 
system of L/K, cannot be contained in 3 together with y. Next observe that ) 
and 3 both contain y. Therefore, 2) — {y} and 3 — {y} are two transcendence 
bases of L over K(y). Since card(3 — {y}) < card3 < n, we conclude from 
the induction hypothesis that card(2) — {y}) < card(3 — {y}) and hence that 
card) < card 3 <n = card &. 


The argument we have just given shows that two transcendence bases X 
and 2) of L/K are either finite, and then of equal cardinality, or both infinite. 
Thus, it remains to look at the case that both X and Q) are infinite. To do 
this, look at any element x € X. Since it is algebraic over K(Q)), there exists 
a finite subset 2), C Y such that x is algebraic already over K(),,). Since 
L cannot be algebraic over K()’) for any proper subsystem 2)’ © QJ, we get 
U,ex Ya = Y. Therefore, we can use the inclusions Y), <> Y) to define a sur- 
jective map [],.<xYc —> Y from the disjoint union of all Y), to Y. Then we 
conclude that card Q) < card([],-z Ya) and even card Q) < card X if we are able 
to show that card([],<y Vx) = card. If X is countably infinite, this equal- 
ity can easily be verified by means of a simple counting argument. The same 
argument can be used in the general case, where we can apply Lemma 7 and 
write X as a disjoint union of countably infinite subsets. After all this, we get 
card 2) < card X, and by reasons of symmetry, also card X < card Y). This im- 
plies card X = card Q) using Lemma 6. 
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The result we have just proved allows us to define for an arbitrary field ex- 
tension L/K its transcendence degree transdeg,L as the cardinality of a tran- 
scendence basis of L/ Kk. Algebraic extensions are always of transcendence degree 
0, while for a polynomial ring K [X ,..., X,] over a field K, its field of fractions 
K(X,...,Xn) is a purely transcendental extension of transcendence degree n 
over K. More generally, for an arbitrary system of variables X, the field of frac- 
tions K(X) of the polynomial ring K[X] is a purely transcendental extension of 
transcendence degree card X over Kk. Since K-isomorphisms map transcendence 
bases into bases of the same type, we have the following corollary: 


Corollary 8. Two purely transcendental field extensions L/K and L'/K admit 
a K-isomorphism L + L’ if and only if L and L’ are of the same transcen- 
dence degree over K. 


In particular, this implies that there cannot exist a K-isomorphism between 
polynomial rings K[X1,...,Xm] and K[Y1,...,Y,] having different numbers 
of variables m and n. Otherwise, the corresponding fields of fractions would be 
isomorphic over K’, although they would have different transcendence degrees, 
which is impossible. We want to exploit the same argument in more detail. 


Corollary 9. Let yp: K[.X1,...,Xm] —> K[M%,..., Yn] be a K-homomorphism 
between polynomial rings such that every variable Y € {Y1,...,Y,} satisfies an 
equation of type 


Ytey 2... +2 =o C1,---,Cr € imy, 


i.e., such that vy is integral in the terminology of Section 3.3; cf. 3.3/4. Then, 
necessarily m > n. Furthermore, ip is injective if and only if m =n. 


Proof. Let R be the image of y. Being a subring of K[Y1,...,Y,], it is an in- 


tegral domain, and we can view K(Yj,...,Y;,) as an extension field of the field 
of fractions Q(R). Since K(¥7,...,Y¥n) = Q(R)(%4,...,Y;,) and since the ele- 
ments Y;,..., Y, are algebraic over Q(R) by our assumption, it follows that the 


extension K(Y1,...,¥;n)/Q(R) is algebraic. Therefore, Q(R) and K(Y,..., Yn) 
are of the same transcendence degree over K, in fact of transcendence degree 
n. On the other hand, since the variables Xj,...,X,, give rise to elements 
L1,-.-,%m € Q(R) generating the extension Q(R)/K, we conclude that m >n 
by Lemma 4. 

Furthermore, ¢ is injective if and only if the elements x,,..., 2% € Q(R) are 
algebraically independent over K, i.e., if and only if transdeg,Q(R) = m. Since 
we know already that transdeg,Q(R) =n, the injectivity of y is equivalent to 
m =n, as claimed. 


Considering a chain of field extensions K Cc L C M, let us add that the 
transcendence degree behaves additively: 


transdeg, M = transdeg, L + transdeg,; M. 
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Indeed, fixing transcendence bases X of L/K and Q) of M/L, it is easily verified 
that XU) is a transcendence basis of //K. Furthermore, observe that the sum 
of the cardinalities of ¥ and Y) is by definition the cardinality of the (disjoint) 
union X U Q). 

Finally, let us show for a purely transcendental extension L/K that the alge- 
braic closure of K in L coincides with K, in other words, that K is algebraically 
closed in L. 


Remark 10. Let L/K be a purely transcendental field extension. Then every 
element x € L— K is transcendental over K. 


Proof. Consider an element « € L that is algebraic over AK’, and choose a tran- 
scendence basis ¥ of L/K such that L = K(X). Then there is a finite subsystem 
(a1,...,@,) of X such that x € K(2,...,2,-). Therefore, to show that x € K, 
it is enough to consider the case that X = (a1,...,2,) and hence is finite. Now 
let 

f=X™ 4X1 +...4+q € K[X] 


be the minimal polynomial of x € L = K(X) over A’, where we may assume 
x # 0 and hence c, # 0. Interpreting AK [X] as a polynomial ring in the vari- 
ables 21,...,2,, we conclude from 2.7/3 that AK[%] is a unique factorization 
domain. Therefore, we can write x as a reduced fraction, say x = g/h, where 
the polynomials g,h € K[X] are relatively prime. Then the equation f(x) = 0 
yields 

g tag” ‘h+...4+¢,h" =0, 


and we see that every prime element g € K[X] dividing h will divide g as well. 
However, this implies that h is a unit in K[X] and hence that h € K™. In the 
same way, we conclude that g € A*, thus showing that x € K. 


Exercises 


1. Compare the notion of a transcendence basis of a field extension L/K with the 
notion of a basis of a vector space. 


2. Show that there exist automorphisms of C that do not map R to itself, and fur- 
thermore, that C contains proper subfields that are isomorphic to C. 


3. Show that the transcendence degree of R/Q equals the cardinality of R. 


4. Show that every field of characteristic 0 is a union of subfields that are isomorphic 
to subfields of C. 


5. Let L/K bea field extension and X an algebraically independent system of L/K. 
Show for every intermediate field K' of L/K that is algebraic over K that X is 
algebraically independent over K’. 


6. Let L/K bea finitely generated field extension. Show for every intermediate field 
L’' of L/K that the extension L'/K is finitely generated. 
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In Section 4.11 we introduced a simplified version of tensor products in order 
to put the process of coefficient extension for vector spaces onto a solid basis. 
Now we want to study tensor products from a more general viewpoint, having 
in mind applications to separable and primary field extensions in Section 7.3. 
We start with the tensor product of modules over a ring; see Section 2.9 for the 
definition of modules. 

In the following we fix two modules M, N over a ring R. If E is yet another 
R-module, a map @: M x N —> E is called R-bilinear, as usual, if for all 
x€M,y€N the maps 


P(a,-): N— EB, zr &(a, z), 
&(.,y): ME, zr &(z,y), 


are R-linear, i.e., are homomorphisms of R-modules. The main point of a tensor 
product of M and N over R is the fact that it is an R-module T such that the 
R-bilinear maps of M x N to an arbitrary R-module FE correspond bijectively 
to the R-linear maps T —> E. More precisely: 


Definition 1. A tensor product of two R-modules M and N over a ring R 
is gwen by an R-module T together with an R-bilinear map 7: M x N —> T, 
admitting the following universal property: 

For every R-bilinear map 6: M x N —> E to an R-module E, there is a 
unique R-linear map y: T — E such that ® = por, in other words, such that 
the diagram 


MxN—-+T 


1A 


E 


as commutative. 


By their defining universal property, tensor products are unique, up to 
canonical isomorphism. They always exist, as we will immediately see below. 
Furthermore, observe that one usually writes MZ @pr N instead of T in the sit- 
uation of Definition 1. Also, given elements x € M and y € N, it is common 
practice to write the image of (a, y) under the R-bilinear map 7: M x N —> T 
in product notation as x®y; elements of type r@y are called tensors in M@rN. 
Using such a notation, the R-bilinear map 7 is characterized by 


Mx N—>M @eN, (x,y) G9 r@y. 
In particular, tensors are R-bilinear in their factors, which means that 


(ax + a'a") ® (by + b’y’) 
=ab(r @y) + ab’ (rx @y') +a dia’ @y) +a (a @y’) 
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for a,a',b,b' € R, v,2' © M, y,y' € N. In many cases the defining R-bilinear 
map T: M x N —> M ®, N is not mentioned explicitly. Then one talks about 
M ®r N as being the tensor product of WM and N over R, assuming that the 
tensors « ® y in M @R N are known. Indeed, knowing all tensors in M @p N, 
the map 7 can be reconstructed. 


Proposition 2. The tensor product T= M®RN exists for arbitrary R-modules 
M and N. 


Proof. The basic construction idea is quite simple. We start with R(™*%), the 
free R-module generated by all pairs (x, y) € Mx N, and divide out the smallest 
submodule Q such that the residue classes of elements of type (x, y) acquire the 
properties of tensors.' This means that we consider the submodule Q Cc RM*%) 
that is generated by all elements 


(eta) Gy) — ea: 
(z,yt+y')—(z,y)— (z,y), 
(ax, y) — a(x, y), 

(x, ay) — a(z,y), 


where a € R, 2,2’ € M, y,y' € N. Then, writing T = R(™*) /Q, the canonical 
map T: M x N —+ T assigning to a pair (x,y) the residue class of (x,y) in 
T is R-bilinear. We want to show that 7 satisfies the universal property of a 
tensor product as stated in Definition 1. Therefore, consider an R-bilinear map 
}: Mx N —> E to some R-module EF. It gives rise canonically to an R-linear 
map ¢: R“*N) —+ EB, by setting (x,y) = (x,y) for basis elements of type 
(x,y) € R™*) and using R-linear extension. Then one concludes from the 
R-bilinearity of @ that ker ~ contains all generating elements of Q as listed 
above, hence that ¢ induces an R-linear map y: R™“*%)/Q —> E satisfying 
@ = yor. The map ¢ is uniquely determined by the relation 6 = yor, since the 
residue classes (2, y) of the basis elements (2, y) € R(*%) generate R™“*%) /Q 
as an R-module and since 


y((x,y)) = ¢(r(z,y)) = O(a, y). 


In particular, y is unique on a generating system of T = R™!*%)/Q and hence 
unique on all of 7. 


Working with tensor products, their explicit construction as given in the 
proof of Proposition 2 is of only minor interest. In almost all cases it is more 
efficient and more appropriate to derive the desired properties from the defin- 
ing universal property of tensor products. For example, we can read from the 
construction of M ®pr N that each element z € M @pR N can be written as a 


| Here (x, y) indicates the element (T'myn)meM,neN in R™™*) that in terms of Kronecker’s 
symbol is given by mn = dm,xOn,y- 
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finite sum of tensors, say z = )>;"_, 2; ® y;. However, this fact is easily derived 
from the universal property of M ®p N as well, since the submodule of M ®p N 
that is generated by all tensors in M @p N satisfies the universal property of a 
tensor product of M and N over R, just as does M @z N itself. Concerning the 
notation of tensors, a bit of care is recommended. Indeed, for any tensor x ® y 
its ambient tensor product M ®pr N where this tensor is built has to be spec- 
ified, unless this is clear from the context. The reason is that for a submodule 
M' C M the tensor product M'@zN is not necessarily a submodule of M@rN. 
In general, there can exist nonzero tensors x @ y in M’ ®p N that vanish as 
tensors in M ®p N. For example, consider the tensor 2 ® 1 in (2Z) @z (Z/2Z), 
as well as in Z ®z (Z/2Z); we will have a closer look at this example later on. 

In many cases it is convenient to describe an R-linear map M @r N — E 
from a tensor product to an R-module F by specifying the images in F of all 
tensors 7 ® y € M @r N, since as we know, these generate M ®p N over R. 
However, when proceeding like this, the images of the tensors of IM ®p N cannot 
be prescribed arbitrarily, since they must respect the rules of R-bilinearity. 
Given a family (Z2,)ceayen Of elements in EL, there exists an R-linear map 
M®rN — Ewith r®y+-> 2,,, precisely in those cases in which (a, y) +> Zz,y 
defines an R-bilinear map M x N —> E. 


Remark 3. For R-modules M, N, P, there exist canonical isomorphisms 


R@rM—™~M, aQxre > az, 
MS@rpNON Oe M, LOy ya, 
(M@rN)@rPS>M @e(N@rP), (Cy) @z+-Fe@ (y@z), 


which are uniquely characterized by the stated mapping rules on tensors. 


Proof. In all three cases, one proceeds in a similar way. First one shows that the 
mapping rule given on tensors leads to a well-defined R-linear map, and then 
one constructs a natural inverse of this map. As an example, let us look at the 
first isomorphism. Since the map R x M —> M, (a,x) -> az, is R-bilinear, 
it gives rise to a well-defined R-linear map y: R@r M — M,a®ax+> ax. 
To construct an inverse of it, look at the R-linear map ww: M — R®p M, 
xe + 1@a. It satisfies yo w(a) = x for all e € M, and in addition, 
poyv(a®x) =v(ar) =18ar=a8« for all tensors a @ a in R@p M. This 
implies yow = id and Woy = id. Therefore, vy is an isomorphism, and y~! = 1. 


In a similar way one proves the following: 


Remark 4. Let (M;)jcr be a family of R-modules, and N another R-module. 


Then there is a canonical isomorphism 


(BM) RN DPMenN), (ier @y (wi BV)ier, 


tel ier 
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which is uniquely characterized by the stated mapping rule on tensors. In par- 
ticular, tensor products are compatible with direct sums. 


Next we define for two given R-linear maps y: M —> M’ andy: N —> N' 
their tensor product p@W: M@rzN — M'@RN' by r@y+> v(x) @ W(y). 
This is permissible, since (x,y) > y(x) ® ~(y) gives rise to an R-bilinear 
map M x N —> M’ @r N’. In particular, we can consider the tensor product 
p@id: M@pzpN —+ M'@pRN of ¢ with the identical map on N; for simplicity, 
it is said that one constructs the tensor product of the map y: M —> M’ with 
the R-module N. To study the behavior of R-linear maps when taking tensor 
products with an R-module N, we use the notion of exact sequence. Hereby one 
understands a sequence of R-linear maps 


My, “2s by S455 SM 


such that im y; = ker y;+1 for 7 = 1,...,r — 2. 


Proposition 5. Let 
M’-*, Ms M” 0 


be an exact sequence of R-modules. Then, for every R-module N, the sequence 


M @npN “4. Megn 22 M’@pN —> 0 


as exact. 


Proof. First observe that (q @ id) o (y @ id) = (Woy) @id = 0, since Voy = 0, 
which shows that im(py ® id) C ker(q @ id). Therefore, w @ id gives rise to an 
R-linear map 

wb: (M ®p N)/im(y @ id) — M" @z N, 


and it is enough to show that ~ is an isomorphism. To construct an inverse of 7 
we use the surjectivity of ~ and choose for every element x2” € M” an element 
ua”) € M such that (v(v")) = x”; the resulting map 1: M” —> M is meant 
as a map between sets only. Now consider the map 


a: M"x N—+(M®@pr N)/im(y ® id), (x,y) u(x") @ y, 


where 1(2”) ® y indicates the residue class of u(x”) @y in (M @r N)/ im(yp@ id). 
We claim that o is an R-bilinear map, the linearity in the second argument being 
clear. To justify the linearity in the first argument, it is enough to show that 
the element (x’) ® y is independent of the choice of the preimage s(x”) € M of 
xv” € M". To check such an independence, consider two preimages x1, 2% € M 
of x”. Then we have x1 — 2 € imy, say x; — x = y(a’), since the sequence 
M' > M > M" > 0 is exact. However, this implies 


r1@y—-wM2By=Y(2') @y=(y@id)(ax’ ®y) =0, 
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as claimed. Therefore, the above map o is R-bilinear, and it is seen that the 
induced R-linear map M” @g N —> (M ®p N)/im(y @ id) is an inverse of @. 


Given R-modules M, N, and a submodule M’ Cc M, we can use Proposi- 
tion 5 to make tensor products of type (M/M') @z N more explicit. From the 
canonical exact sequence 


M’ > M + M/M' —0 


we get the exact sequence 


M'@n2N 22", M@zN 22 (M/M) @2N —> 0, 


which yields an isomorphism 
(M/M') @r N 4 (M @z N)/im(y @ id), TQyH>c@y. 


However, it should be observed that even if y is injective, this does not neces- 
sarily imply injectivity for the tensor product map y ®id. As a consequence, we 
cannot generally use y @ id to view M' @p N as a submodule of M @z N. For 
example, look at the inclusion 2Z ~ Z and consider its tensor product with 
Z/2Z over Z, i.e., the induced map 2Z ®z (Z/2Z) —> Z ®z (Z/2Z). This is 
zero, since in Z ®z (Z/2Z), all tensors of type 2a @ b can be written as a @ 2b 
and hence are zero. On the other hand, 2Z @z (Z/2Z) ~ Z ®z (Z/2Z) ~ Z/2Z 
is nonzero. 

An R-module N is called flat if for every injection of R-modules M' 4 M 
the tensor product map M’@rpN —> M@ RN is injective as well. This is equiv- 
alent to the condition that exact sequences of type 0 + M’—> M—> M" 50 
remain exact under taking the tensor product with N over R. For example, we 
can conclude the following from Remarks 3 and 4: 


Remark 6. Free R-modules are flat. In particular, every vector space over a 


field is flat. 


Next we want to explain the process of coefficient extension for R-modules. 
Let f: R —> R’ be a ring homomorphism and view R’ as an R-module with 
respect to f. Then we can construct for any R-module M the tensor product 
M ®p R’, which, by its definition, is an R-module. We claim that the R-module 
structure extends canonically to an R’-module structure. Indeed, define the 
product of an element a € R’ with a tensor (2 ® b) € M @p R’ by x @ (ab). To 
see that we get a well-defined multiplication by a on M ®p R’, observe that the 
map 

MxR>MeprR, (a,b) > x @ (ab), 


is R-bilinear and hence gives rise to an R-linear map 


M rR >MERR, xt ®@br+x® (ab). 
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Furthermore, using the calculation rules of tensors, it is immediately clear that 
this multiplication satisfies the conditions necessary for an R’-module structure 
on M @R R’. We say that the R’-module M @R R’ is obtained from M by 
coefficient extension. Also it is easily seen that this definition of coefficient 
extension coincides with the one given in Section 4.11, where we restricted 
ourselves to vector spaces over fields; see also Exercise 1 below. 


Remark 7. Let R— R! —> R" be ring homomorphisms, and M an R-module. 
Then there is a canonical isomorphism of R’-modules 


(M @pz R') @p RY SM Op PR", (x @a’) @a" ++ 27 @ (a’a"), 
which is uniquely characterized by the stated mapping rule on tensors. 


Proof. Viewing R’ —> R" as an R-linear map, it induces an R’-linear map 
oa: MRR — M @p FR" by taking the tensor product with M. Furthermore, 


(M @p R’) x R"” + M @z R", (rz, a") a" - a(x), 


is a well-defined R’-bilinear map and therefore gives rise to an R’-linear map 
(M @p R') @px R" — M &ez R", the one considered in the assertion. This map 
is R”-linear, as is easily checked on tensors. To construct an inverse of it, look 
at the R-bilinear map 


M x R" —> (M @e R’) @px R", (z,a")-+ (x @1) @a", 


as well as at the corresponding map M ®@p R"” —>+ (M @pr R') @R R". 


We will use the process of coefficient extension especially for ring homomor- 
phisms of type R —> Rg, where S$ C Ris a multiplicative system; here Rg is the 
localization of R by S, i.c., Rg = S~'R in the notation of Section 2.7. Therefore, 
given an R-module M, we can always consider M ®pr Rg as an Rs-module. On 
the other hand, we can just as well construct from M an Rs-module by applying 
the process of localization to modules instead of rings. Indeed, consider the set 
of all fractions = for  € M, s € S, and identify = in each case with another 
fraction & if there exists an element s” € S such that s"(s'a~ — sx’) = 0. Then, 
using the usual rules of fractional arithmetic, the resulting set is an Rgs-module. 
It is denoted by Ms. 


Proposition 8. Let SC R be a multiplicative system. 
(i) The canonical map R —> Rg is flat, i.e., Rg is a flat R-module under 
this map. 
(ii) For every R-module M, there is a canonical isomorphism of R-modules, 
resp. Rs-modules, 
a ax 
M@rpRsg ~ Ms, Lre®- +o 
Ss cS 


? 


which is uniquely characterized by the stated mapping rule on tensors. 
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Proof. We start with assertion (ii). The map 
M x Rs —> Ms, («, *) pt} 
s 


is well defined, as is easily checked, and R-bilinear. Hence, it gives rise to a 
unique R-linear map y: M @r Rs —> Mg with mapping rule as stated in (ii). 
In addition, the mapping rule shows that y is even Rg-linear. On the other 
hand, it is easily checked that 


1 
wb): Ms — M @p Bs, => 28>, 
Ss 7 


is a well-defined R-linear map that is an inverse of y. Consequently, y is an 
isomorphism. 

Now assertion (i) is easy to obtain. Let a: M’ —> M be an injection of 
R-modules. Then, applying (ii), it is enough to show that the natural map 
og: Mg —> Mg, = +> 2) induced from a, is injective. Therefore, look at an 
element = in Mg whose image in Msg is zero. By the definition of Mg, there is 
an element s” € S such that o(s”x) = s”o(x) = 0. However, the injectivity of 
o implies sx = 0 and therefore = = 0, i.e., og is injective. 


Finally, fixing two ring homomorphisms f: R —> R’ and g: R —> R", 
let us look at the tensor product R! @p R”. In order to avoid mentioning the 
homomorphisms f and g explicitly, we prefer to view R’ and R” as R-algebras; 
see Section 3.3. Observe that R’ @p R” is from the left an R’/-module and from 
the right an R”-module. Furthermore, we want to show that R’ @p R” is even 
an R-algebra. For this we introduce on R’ ®z R” a ring multiplication via the 
rule 


(a @ b) - (c@ d) = (ac) ® (bd). 


To check that we get a well-defined multiplication on R’ @p R”, consider for an 
arbitrary element z = )>;_, ci @ d; € R’ @p R" the map 


r 


R' x R" — R' @p R", (a,b) 4 a-z-b= S-(aci) ® (bd;). 


i=1 


This map is well defined and R-bilinear, since we know already that R’ @p R” 
is an R’-module, as well as an R”-module. Therefore, we obtain the “multipli- 
cation” by z as an R-linear map 


R' @p R" —> R' Op R", a®@br+a-z-b. 
Varying z over R’ @p R", we get a map 
(R’ Qr R") x (R’ Qr R") at R' Qr R", 


which is characterized by the rule 
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(a @ b,c @ d) +> (ac) ® (bd). 


In addition, it admits the properties of a ring multiplication, due to the bilin- 
earity property of tensors. Then, to equip the tensor product R’ @p R” with 
the structure of an R-algebra, we use the ring homomorphism R —> R’ @p R” 
given by a+> (a-1)@1=1@(a-1). 

The tensor product R’ @p R” of two R-algebras R’ and R” comes equipped 
with two canonical R-algebra homomorphisms: 


a’: R' —+ R' @p R", at >a’ @l, 
o": R" —+ R' @p R", a’ —+ 1 wa". 


Together, they provide a unique characterization of the tensor product R’@z R” 
as an R-algebra, as we will see in the next lemma. 


Lemma 9. The above maps o': R’ — R' @R R", o”: R" — R' @p R' 
admit the following universal property: Given two R-algebra homomorphisms 
vy’: R' — A and yp": R" — A to an R-algebra A, there is a unique R-algebra 
homomorphism yp: R' @p R” —+ A such that the diagram 


R’ 


is commutative. Furthermore, y is characterized by a’ @ a" + y'(a’)- y"(a"). 

If the homomorphisms v', p" admit the same universal property as do o',a", 
then ~ is an isomorphism. In particular, the tensor product R' ®p R” as an 
R-algebra is uniquely characterized by the above universal property. 


Proof. To show that y is unique, look at a tensor a’ @ a” € R’ @p R". Then we 
have 


g(a’ ®a") = p((a' ®1)-(1@a")) =9(a' @1)- (1 Ba") = ya’) - y"(a"). 


Therefore, y is unique on all tensors in R’ @p R" and hence on all of R’ @z R". 
On the other hand, we can consider the map 


R' x R" — A, (a’, a") a y'(a’) 3 yp" (a"). 


It is R-bilinear and hence induces an R-linear map y: R’ @p R” —> A. That 
y is an R-algebra homomorphism with the required properties can readily be 
checked. 
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Proposition 10. Let R’ be an R-algebra and X a system of variables, as well 
as a C R[X] an ideal. Then there are canonical isomorphisms 
R[X] @z R'  R'[X], fedrrayf, 
(R[X]/a) @p RS RX] /oR'[X], Fear va, 


which are uniquely characterized by the stated mapping rules on tensors. 


Proof. The canonical R-algebra homomorphisms y’: R[X] —> R'[X] and 
yp": R’ —>+ R’[X] give rise, due to Lemma 9, to an R-algebra homomorphism 


p: R[X] @r R' — R'[X], fedroaf. 


On the other hand, R’ —> R[X] @pr R’, a’ —> 1 @ a’, can be extended via 
X ++ ¥X @ 1 to yield a ring homomorphism ~: R'[X] —> R[X] @r R’; cf. 
2.5/5. One can check without problems that 7 and y are inverse to each other. 
Thus, y is an isomorphism, as claimed. The second isomorphism mentioned in 
the assertion follows from the first one with the help of Proposition 5. 


To end the present section, we want to apply the preceding results on tensor 
products of algebras to the special case of field extensions. In more precise terms, 
we want to determine the tensor product L ®x K’ for certain field extensions 
L/K and K'/K. First of all, let us point out that L®xK’ is a nonzero K-algebra 
containing L and K’ as subalgebras. Indeed, the canonical maps 


L~L@x K SL, K', K'~K @x K' 4 L®x K' 
are injective, due to the flatness of L/K and K'/K. 


Remark 11. Let K'/K be a field extension and f € K[X] a polynomial in 
one variable X. Then 


(K[X]/fK[X]) @x« K' x K'[X]/fK'(X]. 
Furthermore, if f = pi! ... pk" ts a factorization in K'|X] with pairwise nonas- 


sociated prime polynomials p; € K'LX], we get 


(KIXV/PKUX]) @x K! ~ TT K'LXI/oP KX). 


i=1 


Proof. Use Proposition 10 in conjunction with the Chinese remainder theo- 
rem 2.4/14. 


If L/K is a simple algebraic field extension, say L = K(a) with minimal 
polynomial f € KX] of a, and if K’/K is any field extension, then we see in 
the setting of Remark 11 that 


7.2 Tensor Products* 299 


K(a) @x K’ ~ K'[X]/fK'[X] = TL 'tX1/oh #1] 


is a field if and only if f is irreducible over Kk’. However, in general K(a) @x K’ 
will contain nontrivial zero divisors and even nontrivial nilpotent elements, i.e., 
elements z 4 0 such that there is an exponent n € N satisfying 2” = 0. Indeed, 
K(a) ®x K’ contains such nilpotent elements if and only if at least one of the 
above exponents v; is greater than 1. In particular, f cannot be separable then. 


Remark 12. Let L/K be a purely transcendental field extension and X a tran- 
scendence basis generating L over K, i.e., such that L = K(X). Then, for every 
field extension K'/K, there are canonical homomorphisms 


L@x K's K'[¥]s O K'(X), 

where K'|X]g is the localization of the polynomial ring K'|X] by the multiplica- 
tive system S = K[X]—{O0}. In particular, L @x K’' is an integral domain. 
Proof. We view L as the localization K[X]g of the polynomial ring A[X] by 
the multiplicative system S = K[X]—{0}. Then Proposition 10 yields 

K(X] @x K' ~ K'[X] 
and, in conjunction with Remark 7 and Proposition 8, 

L@x K'~ K[X]s @x K' 
~ K[¥]s @xtxj (K[X] @x K’) 


~ K[X]s @xpxy K(X] 
~ K'T¥]s. 


Therefore, L @x K' is a subring of the field of fractions of K’[X], and as such, 
an integral domain. 


We want to explain another property that quite often allows one to reduce 
problems on tensor products to the case in which certain finiteness conditions 
are given. It concerns the compatibility of tensor products with direct limits, see 
Exercise 8 below, a property that we formulate for simplicity only in a special 
case at this place. 


Lemma 13. Let A and A’ be algebras over a field K and observe for subalgebras 
Ap C A and Aj C A’ that Ap @K AG is canonically a subalgebra in A @xK A’. 
Furthermore, let (Ai)ier, (Aj)jer be directed systems of subalgebras in A, resp. 
A’, such that A = Uje, Ai and A’ = Uses Aj. Then (Aj @x Al )ierjes is a 
directed system of subalgebras in A @x A’ such that 


A®Kx A= [J (A: ®x Aj). 


iel jet 
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(Recall that the system (Aj;)ier is called directed if for i,t’ € I there is always 
an index k € I such that A;U Ay C Ag, likewise for all remaining systems; see 
also 4.2.) 


Proof. By the flatness of K-algebras, the inclusions Ap A and Aj > A’ give 
rise to injections 
Ap ®x Ay @ Ap @x AO AKA’. 


In particular, the tensor products A; ®@x Aj; can canonically be viewed as subal- 
gebras of A@x A’, for alli € I, 7 € J. Now consider an element z € A®@x A’ and 
write it as a finite sum z = D1 _) Lp @Yp, where x, € Uj<; Ai and yp € Uses Aj. 
Since (Aj)icer and (Aj) ;<7 are directed, there are indices i € I, j € J such that 
X1,---,0, € Aj and y1,.--, Yr € Ay. Thus, we get 2 € A; @x Aj. 


As an example, we can consider for two field extensions L/K and L'/K the 
directed systems (L;)ier, resp. (Li) jez, of all subfields L; C L, resp. Li c L’, 
that are finitely generated over K’. Then we obtain L@x L’ = je, je (Li@x L;) 
from Lemma 13. Now, if we want to prove a certain property for L @x L’, for 
instance to be an integral domain, we see that L@x L’ does not contain nontrivial 
zero divisors if and only if all L; @x Li admit this property. In this way, we 
can often reduce considerations about general field extensions to the case of 
finitely generated field extensions. In the next section we will make use of this 
possibility on several occasions. 


Exercises 


Let R always be a ring. 


1. Consider the tensor product M pr R' of an R-module M and an R-algebra 
R’. Then show that M @p R’, as an R'-module together with the R-linear map 
tT: M > M @RR',r+> 2 @1, is uniquely characterized by the following uni- 
versal property: For every R-linear map ®: M —+ E to an R'-module E, there 
is a unique R'-linear map py: M ®p R' —> E such that ® = por. 


2. Prove the existence of the tensor product R! @p R" of two R-algebras in a direct 
way, by constructing an R-algebra T admitting the universal property of Lemma 9. 


3. Let R’ be an R-algebra. Show for a flat R-module M that M @p R’ is a flat 
R'-module. 


4. Let M be an R-module. Show: 
(i) If is flat and a € R is not a zero divisor in R, then every equation az = 0 

for « € M implies x = 0. 
(ii) If R is a principal ideal domain, then M is a flat R-module if and only if 


M is torsion-free, in the sense that an equation ax = 0 for elements a € R, 
x € M always implies a = 0 or x = 0. 


5. Consider two R-algebras R’, R”, as well as two ideals a’ C R’, a” C R", and show 
that (R’/a') @p(R"/a") ~ (R' @R R")/(a', a"). Here (a’, a’) indicates the ideal in 
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R' ®p R" that is generated by the images o/(a’) and o” (a) with respect to the 
canonical R-algebra homomorphisms o’: R’ —> R’ @p R", 0": R" — R'@RR". 


6. For a normal algebraic field extension L/K in characteristic p > 0, consider the 
separable closure K,, as well as the purely inseparable closure K; of K in L; cf. 
3.7/4 and 3.7/5. Show that the canonical map K, @x K; —> L, a®b+—> ab, is 
an isomorphism. 


7. Let L/K and K'/K be finitely generated field extensions, where L/K is tran- 
scendental of transcendence degree > 0. Show that L ®x K’ is a field if and only 
if the extension K’/K is algebraic. 


8. Let (M;)ier and (N;)ier be two inductive systems of R-modules (see Section 4.2). 
Show that (M; @pr Ni)ier is naturally an inductive system of R-modules, and that 
there is a canonical isomorphism of R-modules 


(lim Mj) ®p (lim Ni) > lim(M; ®r Ni). 


7.3 Separable, Primary, and Regular Extensions* 


In the present section we study certain classes of not necessarily algebraic field 
extensions whose defining properties can be characterized in terms of tensor 
products. We start with separable field extensions, recalling first of all the def- 
inition of the radical of a ring R. It is denoted by rad R and consists of all 
elements z € R such that there is an exponent n € N satisfying z” = 0. The 
ring R is called reduced if rad R = 0. 


Remark 1. The following conditions are equivalent for an algebraic field ex- 
tension L/K: 

(i) The extension L/K is separable in the sense of Definition 3.6/3. 

(ii) The tensor product L @x K' is reduced for every field extension K'/K. 


Proof. First, consider the case that L/K is separable. Then, applying 7.2/13, 
we may assume that the extension L/K is finitely generated and hence of finite 
degree. By the primitive element theorem 3.6/12, the extension L/K is simple, 
say L = K(a) for some element a € L, and it follows from 7.2/11 and the 
separability of a over K that L @x K"' is reduced for every field extension 
k'/K. This shows that (i) implies (ii). Conversely, assuming (ii), let A’ be 
an algebraic closure of K and fix an element a € L. Since K’/K is flat, the 
inclusion K(a) @ L leads to an inclusion K(a) @« K'’ L®x K’, and we see 
that K(a) @x K’ is reduced. But then the minimal polynomial of a over K can 
have only simple zeros in K’, due to 7.2/11, and it follows that a is separable 
over Ky. Letting a vary over L, it follows that L/K is separable. 


Since condition (ii) in Remark 1 is meaningful also for nonalgebraic ex- 
tensions L/K, we can define the separability for arbitrary field extensions as 
follows: 
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Definition 2. A field extension L/K is called separable if for arbitrary field 
extensions K'/K the tensor product L @x K' is reduced. 


Remark 3. Every purely transcendental field extension L/K is separable. 


Proof. Use 7.2/12. 


Next we want to list some simple properties of separable field extensions. 


Proposition 4. Let M/K be a field extension. 
(i) If M/K is separable and L is an intermediate field of M/K, then the 
extension L/K is separable as well.” 
(ii) M/K is separable if and only if L/K is separable for every intermediate 
field L of M/K that is finitely generated over K. 
(iii) If for an intermediate field L of M/K the extensions M/L and L/K 
are separable, then M/K is separable as well. 


Proof. Consider an intermediate field L of M/k and assume that M/K is 
separable. Then, for any field extension K’/K, the inclusion L + M gives rise 
to an inclusion L@x K' 4 M®x Kk", since K'/K is flat. In particular, L@x K’ 
is reduced if M ®x K’ is reduced. Thus, the separability of M/K implies that 
of L/Kk. Moreover, we conclude from 7.2/13 that M/IK is separable if and only 
if L/K is separable for all intermediate fields L that are finitely generated over 
kK. This verifies assertions (i) and (ii). 

To justify assertion (iii), assume that M/Z and L/K are separable. Further- 
more, choose an arbitrary field extension A’/K. Then R = L @x K’ is nonzero 
and reduced. We need as an auxiliary result the fact that the zero ideal in R is 
an intersection of prime ideals. To prove this, consider for an element s 4 0 in R 
the multiplicative system S = {s°,s1,...}. Since R is reduced, S does not con- 
tain 0. Now, proceeding as in the proof of 3.4/6, we can use Zorn’s lemma 14.5 
to construct an ideal p C R that is maximal with respect to the condition that 
pS = 9. It follows that p is a prime ideal. Thus, for each element s 4 0 in R, 
there is a prime ideal p C R such that s ¢ p. Consequently, the zero ideal in R 
is an intersection of prime ideals, say 0 = yer p;. 

Let Q; be the field of fractions of R/p; for all 7 € J. Then the canonical 
homomorphisms R —> R/p;  Q, give rise to an injection R © Hyer Q; and, 
due to the flatness of M/Z, to an injection M@,; RG M®@, Hyer Q,;. Now we 
use the fact that the map 


(*) Mer[[Q—]][(1 829), 7 ® (jer (@ ayes, 
jEed jEed 


is injective, a result that we will prove further below. Since M/Z is separable, 
we know that all tensor products M @; Q, are reduced. Therefore, M @, R is 
reduced as well, and using the isomorphism 


? See Exercise 1 for the fact that M/L is not necessarily separable if M/K is separable. 
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Me,R=M®,(L@x K') 4M @x K' 


of 7.2/7, the same is true for M @x K'. But this asserts that M/K is separable, 
as desired. 

It remains to show that the map (*) above is injective. To achieve this we 
fix an L-vector space basis (y;)ier of M. Since tensor products commute with 
direct sums, see 7.2/4, we can write each element z © M @,; Hyer Q; as a sum 
z= Vier Vi ® (Gj)jer with unique elements qj; € Qj, where almost all terms 
of such a sum are trivial. In particular, only for finitely many indices 2 € I 
can there exist indices 7 € J such that qj; 4 0. In a similar way, we write 
elements in Hye7(!@ ®1 Q;) uniquely as families of type ()U,<7 Yi @ dij) jes. All 
members of such a family are sums admitting only finitely many nonzero terms. 
In particular, for each 7 € J there are at most finitely many indices 7 € I such 
that qj; #40, although all in all, we can have q; 4 0 for infinitely many indices 
i € I and certain indices 7 € J depending on i. Since the map (*) associates to 
an element of type }),<7 Yi ® (dij) jer the element (S7,<; yi ® dij) jes, we see that 
(x) is always injective, but not surjective in general. 


Definition 5. A field extension L/K is called separably generated if there is a 
transcendence basis X of L/K such that L is separable (algebraic) over K(X). 
In this case, ¥ is called a separating transcendence basis of L/K. 


Since every field extension L// admits a transcendence basis, cf. 7.1/3, we 
see for char kK = 0 that L/K is automatically separably generated. Further- 
more, we can directly conclude the following corollary from Proposition 4 (iii) 
in conjunction with Remark 3: 


Corollary 6. Every separably generated field extension L/K, in particular every 
field extension in characteristic 0, is separable. 


Our next objective is to show for finitely generated field extensions a con- 
verse of this assertion. To do this, recall for a field K of characteristic p > 0 
that we can construct the field K?™ of all p'th roots of elements in K. In this 
way, we get a chain of inclusions 


0 1 —2 


K=K? CK” CK? ¢ 


cary 


where KP” = Ue, KP equals the purely inseparable closure of K, a field 
that is perfect and purely inseparable over K; cf. Exercise 6 of Section 3.7. 


Proposition 7. For a field K of characteristic p > 0 and an extension field L 
of K, the following conditions are equivalent: 
(i) L/K is separable. 
(ii) L@x K” ~ is reduced. 
(iii) For every finite extension K'/K such that K' C K?', the tensor product 
L@x K' is reduced. 


304 7. Transcendental Field Extensions 


(iv) If a1,...,a, € L are linearly independent over K, then so are a},...,a®. 
(v) Every subfield L' C L that is finitely generated over K is separably 
generated over K. 
If L/K is finitely generated, say L = K(a1,...,%n), and separable over K, 
then the system of elements x; can be reduced to a separating transcendence 


basis of L/K. 


Proof. The implication (i) => (ii) is trivial. Furthermore, the implication 
(ii) ==> (iii) follows from the flatness of L/K, since every K’ in the situation of 
(iii) is a subfield of K? ~, so that L @x K’ is a subring of L@x K? ™. 

To verify the implication (iii) => (iv), consider elements a,,...,a, € L that 
are linearly independent over K, as well as elements c,,...,c, € K such that 
Sy oi-1 cia? = 0. Then we can extract the pth root eo € K?” ofc, i=1,...,7, 
and consider the field K’ = K(c?',...,') C K?', which is finite over K. 
Now look at the element z = )7j_, a; ® a € L@x kK’. Since 


2 = doa = Sout) @1= (Sad) or =0 
i=l 


i=1 i=1 


and since L @x K' is reduced, we get z = 0. On the other hand, we claim 
that the tensors a; @1,...,a,@1€ L@x K’ are linearly independent over K’. 
Indeed, (@j_, Kaj) ®x K' is a subvector space of L ®@x K’, due to the flatness 
of K’/K, and we have 


(® Kai) Ox K' + D(Ka @x K’) 
i=l 


i=1 


by 7.2/4. Therefore, z = 0 implies that all coefficients c? ~ and hence all cj are 
zero. This shows that a?,..., a? are linearly independent over K’. 

Next assume condition (iv). To derive (v), it is enough to look at the case 
that L/K is a finitely generated field extension, say L = K(21,...,%n). We show 
by induction on n that L/K is separably generated. The base case n = 0 is triv- 
ial. Therefore, let n > 0 and assume that 71,...,2; is a maximal subsystem of 
X1,.--,€n that is algebraically independent over K. Then t < n, and x,..., 2% 
is a transcendence basis of L//. Since nothing has to be shown for t = n, as- 
sume t <n. Furthermore, let f € K[Xi,...,X+41] be a nontrivial polynomial 
of minimal total degree d such that f(21,...,2441) = 0. If f is even a polynomial 
in X7,...,X?,,, then f is of type f = 0, <;(X")” for coefficients c, € K and 
a finite index set J C N‘*!, where we can assume c, # 0 for all vy € I. Hence, the 
pth powers (a')?... (x¢i4')’, v € I, are linearly dependent over A’, and the same 
holds for the monomials x! ...2;{4' by (iv). In this way, we obtain a relation 
g(@41,..-,%t41) = 0 for a nontrivial polynomial g € K[X1,...,Xt41], whose to- 
tal degree is < d. However, this is excluded by the choice of d, and it follows that 
f ¢ K[X?,..., XP]. As a consequence, there is a variable X; such that f is 
not a polynomial in X?. In particular, h = f(a1,..., 0-1, Xi, Vig,---,Tt41) isa 
nontrivial polynomial in X; with coefficients in A[21,...,%i-1, Viti,---, Tesi], 
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admitting x2; as a zero and having a derivative that is not identically zero. 
Therefore, L is algebraic over K(21,...,%j~1,Vis1,---,%t41). In addition, since 
transdeg,(L) = t, we see that 1,...,Uj-1, Viz1,---,Le41 constitute a transcen- 
dence basis of L/K and hence are algebraically independent over kK. Now ob- 
serve that the minimality of the total degree of f implies that h, as a polynomial 
with coefficients in A[21,..., 2-1, Vit1,---,t41], is irreducible and primitive. 
Furthermore, the polynomial ring in X; over this coefficient domain is a unique 
factorization domain by 2.7/3. Therefore, h is prime and hence also prime in 
K(a1,..-,Vi-1, Viga,---,t41) XG]; cf. 2.7/7. Since in addition, the derivative 
of h is nonzero, h is separable by 3.6/1. Therefore, x; is separable algebraic 
over K(21,...,Uj-1, Vis1,---, X41), and in particular, separable algebraic over 
K(a1,.--,j-1, ®i41,---,2n), which in turn, is separably generated over kK by 
the induction hypothesis. Therefore, all in all, Z = K(2),...,2n) is separably 
generated over K’. This finishes the verification of the implication (iv) => (v). 
In particular, our argument shows for L = K(2,...,2n) that the system of the 
x; can be reduced to a separating transcendence basis of L/K. 

Finally, the implication (v) => (i) follows from Proposition 4 (ii) and Corol- 
lary 6. 


If K is a perfect field in the setting of Proposition 7, then K does not admit 


nontrivial purely inseparable field extensions and hence satisfies K = K? ~. 
Therefore, using Corollary 6 we can state the following corollary: 


Corollary 8. Every field extension L/K of a perfect field K is separable. 


Next we want to consider two further classes of field extensions, namely 
primary and regular field extensions, where primary extensions can be viewed 
as a generalization of purely inseparable algebraic extensions; for example, use 
the characterization of primary extensions given in Proposition 13 below. A ring 
R is called irreducible if its radical rad R is a prime ideal. 


Definition 9. A field extension L/K is called primary (resp. regular) if for 
every field extension K'/K the tensor product L ®x K' is irreducible (resp. 
an integral domain).? Thus, an extension L/K is regular if and only if it is 
separable and primary.* 


We can easily read from 7.2/11 that purely inseparable simple field exten- 
sions L/K in characteristic p > 0 are examples of primary extensions. Indeed, 
if L = K(a), and if f = X”” —c € K[X] is the minimal polynomial of a 
over K, we get L @x K' ~ K'[X]/(f). Over an algebraic closure K’ of K’, we 
can write f = (X — a)’, where we have identified a with the corresponding 
zero of f in K’. Then, since rad(K'[X]/(f)) = (X — a)/(f), it follows that 


3 In the literature, a field extension L/K is usually said to be primary if K is separably 
closed in L. This condition is equivalent to the one given here; see Proposition 13. 
+ Use the fact that a ring is an integral domain if its zero ideal is prime. 
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rad(L @x K’) is prime. Since the inclusion K’ > K’ gives rise to an injection 
L@®x K' > L@ x K' by the flatness of L/K, and since rad(L @x K’) is prime 
as we have seen, the same is true for its intersection with [®, K’, which equals 
the radical of L @®x K'. As a consequence, L/K is primary. 

Similarly as we did for separable field extensions, let us derive some elemen- 
tary properties for primary and regular field extensions. 


Remark 10. Every purely transcendental field extension L/K is regular, and 
hence in particular, primary. 


Proof. Use 7.2/12. 


Proposition 11. Let M/K be a field extension. 

(i) If M/K is primary (resp. regular), the extension L/K is primary (resp. 
regular) for every intermediate field L of M/K. 

(ii) M/K is primary (resp. regular) if and only if the extension L/K is 
primary (resp. regular) for all intermediate fields L of M/K that are finitely 
generated over K. 

(iii) If for an intermediate field L of M/K the extensions M/L and L/K 
are primary (resp. regular), the same is true for M/K. 


Proof. It is enough to consider primary extensions. Indeed, recalling the fact 
that a field extension is regular if and only if it is separable and primary, we can 
use Proposition 4 to settle the separable part. If LZ is an intermediate field of 
M/K and K’ an arbitrary extension field of kK, the inclusion L 4 M gives rise 
to an injection L@x K' 9 M® x K", due to the flatness of K’/K. Next observe 
for a ring extension R C R’ and a prime ideal p’ C R’ that the intersection 
Rp’ is a prime ideal in R, and that rad R = RO rad R’. Thereby we see that 
L/K is primary if the same is true for M/K. On the other hand, if L/K is 
primary for every intermediate field L of M/K that is finitely generated over 
Kk, we can read from 7.2/13 that M/K is primary. Thus, assertions (i) and (ii) 
are clear. 

To verify (iii) consider primary extensions M/L and L/K, as well as an 
extension K'/K. Then R = (L @x K’')/rad(L @x K’) is an integral domain, 
and we can consider its field of fractions Q. Now look at the following sequence 
of homomorphisms: 


M @x K' + M ®, (L®x K’) 4 Me, Rk > M@,Q. 


The first map is the isomorphism of 7.2/7, while the two subsequent ones are 
constructed from the canonical maps L@% Kk’ —+ R — Q by taking the tensor 
product with M over L. Using 7.2/5 in conjunction with the flatness of M//L, we 
can identify ker with the tensor product M @ , rad(L @x K"'), which consists 
of nilpotent elements. Furthermore, w is injective. To see that rad(M @, K’) 
is a prime ideal, consider elements a,b € M ®z (L ®x K’) whose product ab is 
nilpotent. Then (wow)(ab) = (Woy)(a)-(woyw)(b) is nilpotent in M@ ; Q. Since 
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M/L is primary, one of the two factors, say (wo y)(a), must be nilpotent. But 
then, since ker wo y = ker y consists of nilpotent elements, a must be nilpotent 
itself. Thus, radi @x% K’) is prime. 


In the following we want to show that a field extension L/K is already 
primary, resp. regular, if the tensor product L @, K’ is irreducible, resp. an 
integral domain, for all algebraic field extensions Kk'/K. To prove this, we need 
the following key ingredient: 


Lemma 12. A tensor product A @x A’ of two algebras A and A’ over an 
algebraically closed field K is an integral domain if and only if A and A’ are 
integral domains. 


Proof. First assume that A®x A’ is an integral domain. Then A@ x A’ is nonzero, 
and the same is true for A and A’. Therefore, the structural maps kK —> A, 
K —+ A’ are injective. Due to the flatness of A and A’ over K, the tensor 
product maps 


Ar A@®xK K SASK A, A~K@x A SO AQKA 


are injective as well, and we see that A and A’ are integral domains. 

To prove the converse, we apply the geometric methods of Section 3.9. In 
particular, we use Hilbert’s Nullstellensatz 3.9/4. Therefore, assume that A and 
A’ are integral domains. Relying on 7.2/13, we may assume that A and A’ are 
finitely generated K-algebras, say of type 


A=K(X]/p, A E[Y]/q, 


for variables X = (X1,...,X,), ¥Y = (Y%1,..., Ys) and prime ideals p, q. Further- 
more, we know from 7.2/10 that there is a canonical isomorphism 


(K[X]/p) @x (K[Y]/q) > KLX,Y]/(p.q), Fez Fo. 


Now let U = V(p) C A” and U' = V(q) C K® be the algebraic subsets of A’, 
resp. K*, that correspond to p and q. Then U x U’ = V(p,q), which means that 
UxU' coincides with the algebraic set given by the ideal (p,q) C A LX, Y]. Since 
all polynomials in p vanish on U, the substitution homomorphism K |X] — K, 
f +> f(x), factors for « € U through A ~ K[X]/p and hence yields a sub- 
stitution homomorphism A —> K. In particular, as indicated at the end of 
Section 3.9, we can view the elements of A as “functions” on U. Furthermore, 
due to Hilbert’s Nullstellensatz 3.9/4, a function f € A vanishes on all of U 
if and only if f € radA, i-e., if and only if f is nilpotent. However, in our 
case, p is a prime ideal, and hence A = AK [X]/p is an integral domain, so that 
f(U) = 0 is equivalent to f = 0. In a similar way we consider the elements of A’ 
as functions on U’, as well as the elements of A @x A’ as functions on U x U’. 

In a first step we want to show that A®@ x A’ is reduced, which means for an 
element g € A @ x A’ that g(U x U’) = 0 implies g = 0. To do this we need to 


308 7. Transcendental Field Extensions 


consider for points « € U the tensor product of the substitution homomorphism 
A— K, a+ > a(z), with A’, ie., the map 


Or: A@K A A, So ai Ba, S/ ai(x) - al, 
as well as later on the analogous map 
yi A@xA A, Soa @ar->\ a- ally), 


for points y € U’. Choosing a K-basis (e/);c; of A’, we can apply 7.2/4 and 
write every g € A@x A’ asasum g = )0-, 9 Se; with unique elements g; € A. 
Now, after these preparations, look at a nilpotent element g € A ®x A’, say 
9 = Nicer H © &%. Then g vanishes on U x U'. Hence, the same is true for the 
functions o2(g) = )lje7 9i(@) - &; on U', where x € U. Since A’ is reduced, we 
get g;(z) = 0 for all « € U. Furthermore, since A is reduced as well, this means 
that g; = 0 for all i € J, and hence g = 0. Therefore, A ®x A’ is reduced. 

In a similar way we can show that A @,x A’ is even an integral domain. 
Indeed, look at elements f,g € A@x A’, f £0, such that f-g = 0, and write 
9 = Vier HB &% with unique elements g; € A again. Since 


Gal f)* S- 9:(2) : e; = 02(f) + ox(9) = o2(f9) = 0, 


iel 


and since A’ is an integral domain, we get o,(g) = 0, and hence g;(2) = 0 for 
all i € I at those points x € U where o,(f) 4 0, i-e., at all points x € U such 
that there exists y € U’ satisfying f(x,y) 4 0. Therefore, f - (g; @ 1) vanishes 
for alli € J on U x U’, thus implying f - (g; ® 1) = 0, as we have seen before. 
Furthermore, we look at the equation 


Ty(T)* ge = %(F ~(g:® 1)) =0 


for y € U’. Since f 4 0, there exist points (x,y) € U x U’ such that f(x,y) 40 
and hence such that 7,(f) 4 0. However, since A is an integral domain, this 
implies g; = 0 for all i € J, and hence g = 0. 


Proposition 13. The following conditions are equivalent for a field extension 
L/K: 
(i) L/K is primary. 
(ii) L @x K’ is irreducible for every finite separable extension K'/K. 
(iii) K ts separably closed in L, i.e., every element a € L that is separable 
algebraic over K is already contained in K. 


Proof. The implication (i) => (ii) is trivial. Therefore, assume condition (ii) and 
let a € L be separable algebraic over K,, with corresponding minimal polynomial 
f € K[X] of a over K. This polynomial factorizes over L into a product of 
irreducible factors, say f = f,...f,, and there cannot be multiple prime factors, 
since a, and hence f, are separable. Writing kK’ = K(a), we get 
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Lex K'~][L[X1/(fi), 


i=1 


from 7.2/11, thus implying that L @x K’ is a finite product of fields. In partic- 
ular, the radical rad(L ®x K") is zero. On the other hand, this radical is prime 
by our assumption. Therefore, we must have r = 1, and it follows that f is 
irreducible in L[X]. Since a € L is a zero of f by the definition of f, there is a 
factorization of type f = (X — a)-g in L[X]. But f being irreducible implies 
g = 1, and therefore a € K. This shows that K is separably closed in L. 

Now assume condition (iii). To show that L @x K’ is irreducible for arbi- 
trary field extensions A’/K, let us first consider the case that A’/K is a finite 
separable extension. Then A’/K is a simple extension by the primitive element 
theorem 3.6/12, say K’ = K(a). Let f € K[X] be the minimal polynomial of 
a over K. It is irreducible over K, but also over L. Indeed, if f= g-hisa 
factorization into monic polynomials g,h € L|X], then the coefficients of g and 
h are separable algebraic over K, since they belong to the splitting field of f 
over KX. Since K is separably closed in L, we get g,h € K[X]. Furthermore, the 
irreducibility of f over kK implies g = 1 or h = 1 and hence that f is irreducible 
over L. But then, using the isomorphism L @x« K' ~ L[X]/(f) of 7.2/11, we 
see that L @x K’ is a field. 

In a next step, we consider a finite separable extension K’/K as before, 
and a finite purely inseparable field extension K"/K’, assuming that we are 
in positive characteristic. Immediately following Definition 9 we showed that 
purely inseparable simple extensions are primary. Therefore, we can conclude 
from Proposition 11 (iii) that K”/K’ is primary as well. Thus, the tensor prod- 
uct L@x K" ~ (L@x K") x K" is irreducible, and we see that L @x% K” is 
irreducible for all finite extensions K”/K. But then, if K is an algebraic closure 
of K, it follows that L @x K is irreducible as well. Indeed, using 7.2/13, we can 
interpret the radical rad(L @« K) as the union of all radicals rad(L @x K"), 
where K”/K varies over the finite extensions that are contained in K. 

Using Lemma 12, it is now easy to see that L @x K' is irreducible for 
arbitrary extensions K'/K, and therefore that L/K is primary. Indeed, choose 
an algebraic closure K’ of K’ and consider the injection L @« K’ @ L @x K' 
induced from K’ <+ K’. It is enough to show that L @x K’ is irreducible. We 
have just seen that L @x K is irreducible if K is the algebraic closure of K in 
Kk’. Since the tensor product 


((L Ok K)/rad(L Ok k)) Qz k' 


is an integral domain by Lemma 12, it is seen similarly as in the proof of 
Proposition 11 (iii) that L @x K’ is irreducible. 


Combining the characterizations we have proved for separable and for pri- 
mary field extensions, we can derive a corresponding characterization for regular 
field extensions. 
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Proposition 14. The following conditions are equivalent for a field extension 
L/K: 
(i) L/K is regular. 
(ii) L @x% K’ is an integral domain for every finite extension K'/K. 
(iii) L/K is separable and K is algebraically closed in L. 


Proof. A ring R is an integral domain if and only if the zero ideal 0 C R is 
prime. This is equivalent to the fact that the radical rad R is both prime and 
zero. This justifies the equivalence of (i) and (ii) if we use Propositions 7 and 13. 

To derive the equivalence of (i) and (iii), start with a regular extension 
L/K. Then the algebraic closure of kK in L is regular over K as well, due to 
Proposition 11 (i). Therefore, it is enough to consider the case that L/K is 
algebraic. But in this case, Remark 1 and Proposition 13 imply immediately 
L = K and therefore (iii). On the other hand, if condition (iii) is given, we can 
deduce (i) from Proposition 13 again. 


Finally, let us indicate a geometric application of the results we have ob- 
tained in the present section. Let K be a field and K an algebraic closure. 
Working in the setting of Section 3.9, consider an algebraic subset U C K” 
of K” that is defined over K and irreducible over K, in the sense that the 
corresponding ideal p = In (U) C K[X1,...,Xn] is prime; see also the geo- 
metric interpretation of irreducibility in Exercise 4 of Section 3.9. Then we can 
view U just as well as an algebraic subset of K” that is defined over K. Let 
Ix(U) be its corresponding ideal in K[X1,...,X,], which by Hilbert’s Null- 
stellensatz 3.9/4 satisfies Ee(U) = rad(pK[X1,...,Xn]). The algebraic set U 
is called geometrically reduced if Ize(U) = pK [X,,...,Xn], ie., if the ideal 
pA [X1,...,X;,] is reduced. Furthermore, U is called geometrically irreducible 
if -(U) =rad(pK[X1,...,X,]) is prime, hence if U is irreducible as an alge- 
braic set. defined over K. In Exercise 4 below we show that U is geometrically 
reduced (resp. geometrically irreducible, resp. geometrically reduced and ge- 
ometrically irreducible) if and only if, writing Q for the field of fractions of 
K[X1,...,Xn]/p, the extension Q/K is separable (resp. primary, resp. regu- 
lar). 


Exercises 


1. Consider field extensions K C L C M, where M/K is separable (resp. primary, 
resp. regular). Then the extension L/K is separable (resp. primary, resp. regular), 
as we have seen. Is a corresponding assertion also valid for the extension M/L? 


2. A field extensionL/K is primary if and only if K is separably closed in L. Is it 
possible to characterize separable extensions L/K for p = char K > 0 in a similar 
way, for ecample by requiring that a? € K for a € L implies a € K or that the 
algebraic closure of K in L is separable over K? 


3. Give an example of a separable field extension that is not separably generated. 
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4. Let K[X] be the polynomial ring over a field AK in finitely many variables 
Xy,...,X,. Furthermore, consider a prime ideal p C K[X], the correspond- 
ing field of fractions Q = Q(K[X]/p), as well as an algebraic closure K of K. 
Show: 

(i) The extension Q/K is separable if and only if the ideal pk [X] is reduced 
in K[X]. 
(ii) The extension Q/K is primary if and only if the ideal rad(pK [X]) is prime 
in K[X]. 
(iii) The extension Q/K is regular if and only if the ideal pk [X] is prime in 
K[X]. 

5. For a field K and an algebraic closure K show that an extension L/K is regular 
if and only if L @x K is a field. 

6. For a perfect field K, consider two reduced K-algebras A, A’. Show that their 
tensor product A ®x A’ is reduced as well. 

7. Let K be a field of characteristic p > 0. A system x = (21,...,%p) of elements 
in KP’ is called p-free over K if the extension K (a)/K cannot be generated by 
fewer than n elements. Show: 

(i) n elements 21,...,%n € KP" are p-free over K if and only if the canonical 
map K[Xq,...,Xn]/(X7? — 2,...,Xk — 2h) —>+ K(2) is an isomorphism. 
(ii) A field extension L/K is separable if and only if the following condition 


holds: If 21,...,%, € K are p-free over K”, then these elements are p-free 
over L? as well. 


7.4 Differential Calculus* 


The objective of the present section is to characterize separable field extensions 
in terms of differential calculus. However, the methods we will use are not 
based on infinitesimal limits as is customary in analysis. In fact, they are of 
purely algebraic nature and find their natural continuation in the study of so- 
called étale and smooth morphisms in algebraic geometry; see, for example, [3], 
Chap. 8. In the following, let R always be a ring. 


Definition 1. An R-derivation of an R-algebra A to an A-module M is given 
by an R-linear map 6: A —> M satisfying the “product rule” 


Without further specification, a derivation is meant as a Z-derivation. 


For elements r € R we have always 6(r- 1) = 0. Furthermore, from the 
product rule one easily derives the “quotient rule” 


5(£) - —_— 
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for elements f,g € A, where g is a unit. The set of all R-derivations 6: A —> M 
yields an A-module, which is denoted by Derr(A, M), or by Der(A, M) if R = Z. 
For example, if A = R[X] is the polynomial ring in one variable over R, then 
the usual formalism of differentiating polynomials 


d 
sy RIX] RIX], F(X) F(X), 


gives rise to a well-defined R-derivation of RLX’] to itself. Since an R-derivation 
6: RLX] —> R[X] is already uniquely determined by the element 5(X) due 
to the product rule, we see that Derg(RLX], RLX]) is a free R[X]-module, 
generated by the derivation aia 
Proposition 2. Let A be an R-algebra. Then there exists an A-module Vue 
together with an R-derivation dajp: A — rr such that (Qayrs da/r) admits 
the following universal property: 

For every R-derivation 6: A —> M to an A-module M, there is a unique 
A-linear map (p: VR —> M satisfying 0 = po da;r, in other words, making 
the diagram 


A—- ans ur 
5 | “e 
yp 
M 


commutative. The pair (Qa yps da/p) is uniquely determined by this property, up 
to canonical isomorphism. Furthermore, (Qayp: dar): or just Vryrs is referred 
to as the module of relative differential forms (of degree 1) of A over R. 


Proof. We start with the case A = R[X], for a system X¥ of (arbitrarily many) 
variables X;, i € I. Then define (1, i = A as the free A-module generated 
by I. Writing dX; for the basis element of (2), IR that is given by 7 € J, we get 
ta /R = @i-, 4: dX;. Now, taking the partial derivatives of elements f € A 
with respect to the X; in a formal sense, it is easily seen that 


dajp: A—+ Uy, prod stax, 


ie. 


is an R-derivation satisfying d4/;p(X;) = dX;, and further, that (Qh yp dayr) 
satisfies the universal property of a module of relative differential forms of A 
over R. Indeed, if 6: A —> M is an R-derivation to an arbitrary A-module M, 
define an A-linear map y: Cur — M by y(dX;) = 6(X;) for i € I. Then 
y oda/r is an R-derivation from A to M that coincides with 6 on the variables 
X;,7€ I. Using the A-linearity and the product rule, we get 


Of 
+X; axX,) 


(dX;) = po dar(f) 
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for all f € A, thus implying 6 = yo da;r. In particular, since y(dX;) = 6(X;) 
by the latter relation, y is unique. 

In the general case we may assume A to be of type R[X]/a, for a system 
of variables X and an ideal a C R[X]. Therefore, it is enough to prove the 
following lemma: 


Lemma 3. For an R-algebra A and an ideal a C A, write B = A/a. Further- 
more, let (Qayrs dajr) be the module of relative differential forms of A over R. 
Then 


2 = Ujp/(aQhp + Adajr(a)), 


together with the R-linear map d: B —+ Q induced from dajp: A — Va» is 
the module of relative differential forms of B over R. 


Proof. First observe that Q is indeed a B-module. Further, since d4/pz admits the 
properties of an R-derivation, the same is true for d. To establish the universal 
property for d, consider an R-derivation 6: B —+ M to a B-module M. Then 
the composition 6 = 607 of 6 with the projection 7: A —> A/a = B is an 
R-derivation from A to M, when viewing M as an A-module. Furthermore, the 
universal property of d4/z: A — Q), jp ensures that 6 factors uniquely through 
an A-linear map y: Vie —> M. Since we have 6(a) = 0 and since M is a 
module over B, we see that necessarily, p(aQ, jr + Adajr(a)) = 0. Therefore, y 
gives rise to a B-linear map @: Q —> M satisfying 6 = God. That G is uniquely 
determined by this equation follows from the uniqueness of y. This settles the 
proof of Lemma 3 and thereby also the proof of Proposition 2. 


In particular, the preceding proposition says that the map y+—+ yo da;r 
gives rise to an A-linear bijection 


Hom 4 (Qype, M) — Derr(A, M) 


between A-module homomorphisms 2), os M and R-derivations from A to 
M. Furthermore, we conclude from its universal property that oF /R is generated 
by all differentials of elements in A, i.e., by all elements of type dayr(f), f € A. 
In more precise terms, the argument given in the proof of Proposition 2 yields 
the following: 


Proposition 4. Let A be an R-algebra and x = (x;)ier a system of elements 
in A such that A= R[x]. Then: 

(i) (dayr(ai))ier 18 an A-generating system of QR: 

(ii) If x = (x;)ier is algebraically independent over R, then (da/r(#i))ier ts 
a basis of Uri in particular, Qasr is then free. 


Next we want to show that every homomorphism of R-algebras 7: A —> B 
gives rise to a canonical exact sequence of B-modules 
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W284 B + bp ae oy 0. 


To define the map a, look at the composition of 7: A —> B with the 
R-derivation dgjp: B —> Q, IR: Viewing 2% jp as an A-module under T, we 
see that dg;poT is an R-derivation of A. By the definition of OF /R? this deriva- 
tion factors through an A-linear map 


ie — Vr: dajr(f) + dayr(r(f)), 
which, in turn, gives rise to a B-linear map 
Q: Qn Ba B— OBR; dajr(f) ® b+ b- dgyr(t(f)).- 


Finally, to set up 8, observe that every A-derivation of B can be viewed as an 
R-derivation of B. Thus, by the universal property of Q} /R there is a well- 
defined map 


B: OBR —T Qa, dp/r(g) > dpya(g)- 


Proposition 5. For every homomorphism of R-algebras 7: A —> B, the se- 
quence 


O12 @aB + Mig —> 04,4 —> 0 
given by dajr(f) @ b+ b- dpyr(t(f)), dayr(g) > dpya(g) is exact. 


Proof. Since QA is generated by the elements of type dg/a(g), g € B, and 
since B(dg/r(g)) = deja(g), we see that 6 is surjective. Furthermore, we have 
8B oa=O0, which implies ima C ker 8. To prove ima = ker (, it is enough to 
show that Q%/p/ ima together with the map d: B —> Q3/,/ ima induced by 
dg/r admits the properties of the module of relative differential forms of B over 
A. To justify this, consider the following commutative diagram: 


d 1 
A SE M284 B 
-B Be 


d QR = —> OB R/ima 

First, d: B—> Q5 IR /ima is an A-derivation, since it is an R-derivation by its 
definition and since we have dg/r(t(f)) € ima for all f € A. Now ifd: B —> M 
is an A-derivation from B to a B-module M, then it is an R-derivation as well. 
Thus, there exists a unique B-linear map y: 0} ns M such that 6 = yodg;r. 
Since 6 is an A-derivation, we get 607 = 0 and hence yoa = 0. However, this 
means that y factors through a B-linear map P: QB /R/ ima —> M. We have 
6 = God by construction, and @ is uniquely determined by this equation. 


Let us evaluate the exact sequence of Proposition 5 in a special case. 
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Proposition 6. For an R-algebra A and a multiplicative system S C A, con- 
sider the canonical map T: A —> Ag from A to its localization by S. Then the 
corresponding map 


a: Up ®@4As 4%. pn, — dajr(f) @a- a- dagr(r(f)), 


is byective. In particular, we get ag /A = 0. 


Proof. The relation ng ja = O18 easily derived from the bijectivity of a; use 
Proposition 5, or look at the case R = A and use /,,, = 0. Therefore, it 
remains only to show that a is bijective. To do this we identify 0), [R@A Ag 
with the As-module (Q4,)5, see 7.2/8, and show that (Q4),)5 together with 
the map 
i: Ag > (Oya) : = es 

admits the universal property of the module of relative differential forms of Ag 
over R. First, we have to justify that d is well defined. Indeed, if £ = £ for 
elements f, f’ € A and s,s’ € S, then there is an element s” € S such that 
s"(s' f — sf’) = 0 holds in A. This implies 


(s'f — sf’): dajr(s") + 8" dajr(s'f — sf’) =0 
and we see, multiplying by s”, that d4/p(s'f — sf") vanishes as an element of 
(Qyr)s- In other words, we have s’d(f) — sd(f’) = f’d(s) — fd(s‘), writing 6 
for the composition of d4/pz with the canonical map Vie -> (Qe) s- That 


the map d: Ags —> (0, jr)s 8 well defined follows then from the following 
computation: 


s(sd(f) — f5(s)) — s°(s'6(f") — f'5(s')) 
= ss'(s'5(f) — o — sfd(s)+s . oe 
= ss'(f’5(s) — fay s” f5(s) + 8? f'd(s') 
= s'(ef'— s'f)i(6) + a(of" — 5'F)5(6" 
=0. 


One can check now in a straightforward manner that d is a derivation. How- 
ever, we will skip this step. Finally, to test the universal property, consider an 
R-derivation 6: Ag —> M to an As-module M. Then 607 is an R-derivation 
from A to M, and there exists an A-linear map y: 2), IR? M such that 
d60T = yoda;r. Using Ag-linear extension, y gives rise to an Ag-linear map 

3: (Qyp)s —> M satisfying 6 = ys od, and it is easily seen that yg is 
oe determined by this equation. 


In the following we want to apply the theory of differential forms to field 
extensions. Looking at some special cases, we will determine the module of 
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relative differential forms Q], jx that is attached to a field extension L/K. In 
principle, this can be done using Lemma 3 and Proposition 6. However, from 
a technical point of view, it is often easier to determine instead of 07 L/K its 
dual space, namely the L-vector space Derg (L, L) ~ Hom (Q5,4¢5 L). There is 
a canonical injection 


Qn K <> Hom, (Derx(L, L), L), drjx(z) + (5-4 d(2)), 


in terms of L-vector spaces, which is bijective if one of the spaces Qi) /K and 
Derx(L, L) is of finite dimension over L. 


Proposition 7. Let L/K be a field extension and x = (xj) jes @ system of gener- 
ators. Choosing a system of variables X = (Xj) jes, look at the K-homomorphism 
am: K(X] —> L given by X; > a;, and let (fi)ier be a system of generators of 
the ideal ker 7. Furthermore, consider a derivation 6: kK —+ V to an L-vector 
space V, as well as a system (v;)je of elements in V. Then the following con- 
ditions are equivalent: 

(i) 6 extends to a derivation 6’: L —>+ V satisfying 6'(x;) = v; for all j € J. 

(ii) The relations 


are valid in V, where f? for f € K[X] indicates the “polynomial” that is 
obtained in V[X] := V @« K[X] by applying 6 to the coefficients of f, i.e., 
=>, d(e,)X” for f = Yo, eX” 


If there exists an extension as in (i), it is unique. 


Proof. Assuming condition (i), we obtain for polynomials f = >>, c,.X” € K[X] 
the equation 


ale) or Le =F 2) + DEE yy, 


ie., 6’, as an extension of J, is uniquely determined on A’[x] by the equations 
6'(a;) = v;, 9 € J. Furthermore, using the quotient rule 


a(2) _ bd'(a) — ad’(b) 

b b? 

for elements a,b € K [x], b 4 0, the uniqueness assertion of 6’ extends to K(x); 
alternatively, one can use Proposition 6. Also we see that the equations of (ii) 


are valid because f;(a) vanishes for all 7 € J. ; 
Now assume condition (ii) and define a derivation 6: A[X] —>+ V by 


+r te», 


jeJ 
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Viewing V as a K[X]-module under the map 7: K[X] —> L, it is easily 
verified that we indeed get a derivation. It satisfies 6(f;) = 0 for all i € I, due 
to the equations in (ii). Furthermore, the product rule implies 6(gf;) = 0 for 
arbitrary elements g € K[X]. In this way, 6 vanishes on the ideal generated by 
(fi)ier in K[X] and hence on the kernel of the map 7: K[X] —> L, ¥ +> z. 
Therefore, 5 induces a derivation 6: K [x] —> V extending 6. Finally, we can 
use the quotient rule or Proposition 6 in order to extend 6 to a derivation 
': K(z) 3 V. 


The result of Proposition 7, which we have just proved, provides a use- 
ful tool for computing 07 jx and Derg (L,L) for field extensions L/K’, since 
K-derivations L —+ L may be viewed as extensions of the trivial deriva- 
tion kK —> L. However, in most cases one will subdivide an extension L/K 
by suitable intermediate fields, say K C L’ C L, and start by examin- 
ing the K-derivations of L’. Subsequently one needs to know how to extend 
K-derivations on L' to K-derivations on L so that after all, one obtains in- 
formation on the K-derivations of L. This is a typical scenario for applying 
Proposition 7. Alternatively, one can use the exact sequence of Proposition 5 
for a chain of field extensions K Cc L’ Cc L. Here one would like the map 
Qa: Qik Sp LS Qh K to be injective, which, however, is not the case in 
general. One can show that the injectivity of the map a is equivalent to the con- 
dition that every K-derivation L' —> L admits an extension to a K-derivation 
L —> L; cf. Exercise 3. 


Next we want to rephrase the assertion of Proposition 7 to yield assertions 
on modules of differential forms. 


Corollary 8. Let L/K be a purely transcendental field extension that is gen- 
erated by a transcendence basis (x;)jes. Then (drx(aj))jer 18 a basis of the 
L-vector space Qh KR: 


Proof. Use Propositions 4 and 6. Alternatively, one can rely on the assertion of 
Proposition 7, at least if the transcendence basis (2;) je, is finite. 


Corollary 9. Let L/K be a separable algebraic field extension. Then, for every 
derivation 6: K —+V to an L-vector space V, there exists a unique extension 
as a derivation 6': L —+ V. In particular, we get Qo K =0. 


Proof. Let 6: K —>+ V be a derivation to an L-vector space V, and let L’ be an 
intermediate field of L/K such that L’/K is finite. As we know, L’/K is simple 
by the primitive element theorem 3.6/12, say L’ = K(x) for some x € L. Let 
f € K[X] be the minimal polynomial of « over K. Then, given v € V, the 
condition of Proposition 7 for extending 6 to a derivation 6’: K(x) —+ V such 
that 6’(x) =v reads 


f(a) + f'(w)-v =0. 
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Since f is separable, the derivative f’ of f is a nonzero polynomial. Furthermore, 
we have f’(x) #4 0, since the degree of f’ is smaller than the degree of the 
minimal polynomial f of x. Therefore, v is uniquely determined by the preceding 
equation, and we see that 6 extends uniquely to a derivation 0’: L’ —> V. 

From this we can easily conclude that 6 extends uniquely to a derivation 
0’: L —+ V. Indeed, for every intermediate field L’ of L/K that is finite over 
K, we can extend 6 to a derivation 6’: L’ —> V, as just seen. Since every 
such extension is uniquely determined by 6 and since L can be exhausted by 
subfields of type L’, it follows that there is a unique extension of 6 to a derivation 
L— V. 

In particular, the trivial derivation K —> L extends to the trivial derivation 
L —+ L only, which implies Derx (LZ, L) = 0 and hence OG K = 0. 


In contrast to the assertion of the preceding corollary, the extension of 
derivations, for example in the setting of Proposition 7, poses problems if the 
extension L/ IK is not separable. 


Corollary 10. Let K be a field of characteristic p > 0 and L/K a purely 
inseparable field extension of degree p, say L = K(x) with minimal polynomial 
of x over K given by f = X? —c € K[X]. Furthermore, let 6: K —> V be a 
derivation to an L-vector space V. Then: 

(i) d(c) = 0 if there exists a derivation 0’: L —> V extending 0. 

(ii) Conversely, if 6(c) = 0, there exists for every v € V a unique derivation 
6’: L —+ V that extends 6 and satisfies 6'(a) = v. In particular, dr/K(x) is an 
L-basis of Qo KR: 


Proof. According to Proposition 7, the derivation 6 admits an extension to a 
derivation 6’: L —>+ V satisfying 6/(a) = v if and only if the equation 


—d(c) + pa2?!-v=0 


is satisfied, hence if and only if 6(c) = 0, where in the latter case, the value of 
6'(x) = v can be selected without any restriction. Therefore, Derx(L, L) is of 
dimension 1 over L, and the same is true for 0}. Furthermore, dz x(x) is a 
basis of QF j4¢- 


Now we are able to derive the desired characterization of separable field 
extensions, at least in the case of finitely generated extensions. 


Theorem 11. Consider a field extension L/K that is generated by r elements, 
say L = K(y,..-, Yr). Then 


transdegy L < dim, Qy/% <r. 


Furthermore, transdeg, L = dimy Qu is equivalent to the fact that L/K is 
separable. 
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Corollary 12. A finitely generated field extension L/K is separable algebraic 
if and only if Qo Ke = 0. 


Corollary 13. Let L/K be a separable finitely generated field extension. Then 
the following conditions are equivalent for elements x1,...,%m © L: 

(i) @1,.-.,%, form a separating transcendence basis of L/K. 

(ii) dryx(@1),---,dz/K(&n) form an L-basis of O7 pK: 


The assertion of Corollary 12 is a special case of Theorem 11. Therefore, it 
has been called a “corollary.” However, from a proof-theoretic point of view, it 
serves as a preparatory lemma that will be used during the proof of Theorem 11. 

Accordingly, we start with the proof of Corollary 12. If L/K is separable al- 
gebraic, we have Q) jn 0 by Corollary 9. Conversely, assume Q a = 0, which 
is equivalent to Derg (L, L) = 0. Choosing a transcendence basis 21,...,2p of 
L/K, the field L is a finite algebraic extension of K(21,...,2%n). If this exten- 
sion is even separable, we read from Corollaries 8 and 9 that Derx(L, L) is of 
dimension n over L. Therefore, n = 0, and L/K is separable algebraic. 

On the other hand, if the extension A’(21,...,%,) C L is not separable 
for p = char K > 0, then there is an intermediate field L’ of L/K such that 
L/L’ is purely inseparable of degree p. Furthermore, there exists a nontrivial 
L'-derivation EL —+ L by Corollary 10, and hence in particular a nontrivial 
K-derivation L —+ L. However, this is in contradiction to Derg (L, L) = 0, 
so that LZ must be separable over K(21,...,2,). This settles the assertion of 
Corollary 12. 


Now we turn to the proof of Theorem 11. It follows from Propositions 4 and 
6 that Or Ke is generated by the elements dz/x«(y1),.--,dz/K(Yr). Therefore, 
we get dim; Q8 Ke < _r. Now choose elements 21,...,%, € LD such that the 
differential forms dzp/x«(«1),.--,dz/K(%n) yield a basis of Qh: Furthermore, 
let L' = K(a1,...,2n). Then, looking at the exact sequence 


yj Oy L 4 OU), + %,, —> 0 


of Proposition 5, the map a is surjective, which shows that 07, jn 0. Therefore, 
as we have seen, the extension L/L’ is separable algebraic, and we get 


transdeg, L = transdeg,,L' < n = dim, Qh /K- 


If the above inequality is an equality, then 21,...,2, are algebraically indepen- 
dent over K, and we see that the extension L/K is separably generated and in 
particular separable; cf. 7.3/6. Conversely, if L/K is a finitely generated separa- 
ble field extension of transcendence degree n, then L/K is separably generated 
by 7.3/7. Furthermore, Corollaries 8 and 9 show that Derx(Z,Z) and hence 


ee JK are of dimension n over L. 


Finally, to achieve the proof of Corollary 13, let L’ = K(a1,...,%,) and 
consider the exact sequence 
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Wye Ory L 4+ Oh, 4 by, —> 0 

of Proposition 5. Furthermore, assume that 71,...,%, form a separating tran- 
scendence basis of L/A. Then we conclude that (7 jy = 9 from Corollary 12 
or Corollary 9, and the map a is surjective. But it is even bijective, since 
dim; (Q7,)~ @1 L) = n by Corollary 8, as well as dimQ;),~ = n by Theo- 
rem 11. Since dyr/q(#1),..-,dp//K(%n) form a basis in QuiK the same is true 
for the images in QF /K> Since a: is bijective. 

Conversely, if dz«(x1),...,dz/K(@n) form a basis of QR we conclude as 


in the proof of Theorem 11 that x1,...,%, are a separating transcendence basis 
of L/K. 


The assertion of Corollary 13 shows again, using Proposition 4 in conjunc- 
tion with Proposition 6, that for a separable finitely generated field extension 
L/K, every generating system can be reduced to a separating transcendence 
basis. 


Exercises 


1. Is the condition Qh = 0 for an arbitrary field extension L/K equivalent to the 


fact that L/K is separable algebraic? 


2. Consider a field extension L/K in characteristic 0. Show that every derivation 
K —+ V to an L-vector space V extends to a derivation L —> V. 


3. For field extensions R C K C L, consider the map 
a: Up @x LOR, — dxjr(@) @a-> a- dy /pR(z). 


Show that a is injective if and only if every R-derivation K —+ L extends to an 
R-derivation L — L. 


4. Let L/K bea finitely generated field extension, say LD = K(x1,...,Up). Assume 
that the kernel of the K-homomorphism K[X1,...,X,] —> L mapping X; to 
x; for all i is generated by polynomials f),...,f- € K[X1,...,Xn] that satisfy 


the condition 


Show that L/K is a separable extension of transcendence degree n — r. 


5. Let L/K be a field extension in characteristic p > 0 such that L? Cc K. Fur- 
thermore, let (x;)iex be a p-basis of L/K, i.e., a p-free system (cf. Exercise 7 of 
Section 7.3) generating the extension L/K, and let 6: K —+ V be a derivation 
into an L-vector space V. Show for c; = x?: 

(i) If there exists a derivation 6’: L —> V extending 6, then 6(c;) = 0 for all 
aed. 
(ii) Conversely, if 6(c¢;) = 0 for alli € J, then for every system (v;)ie7 of elements 
in V, there is a unique extension 6’: L —+ V of 6 satisfying 6’(2;) = v; for 
all i. 
(iii) The differential forms dz/« (ai), 1 € J, constitute an L-basis of Ob Ke: 
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6. Show that a field extension L/K is separable if and only if every derivation 
kK —+ L extends to a derivation L —> L. Hint: Use Exercise 2, as well as 
Exercise 5 for characteristic p > 0 in conjunction with the characterization of 
separable extensions in Exercise 7 of Section 7.3. 


Appendix 


Solutions to Exercises 


Each section of the book ends with a list of specially adapted exercises, some 
of them printed in italics. The latter are particularly suited for digesting the 
material presented in the corresponding sections. Different from the remaining 
exercises of more classical type, they concern problems that could just as well 
be addressed in conversational discussions. Only for these exercises do we give 
explanations and hints for their solutions below. 


1.1, Exercise 1. Clearly, conditions (ii) and (iii) of 1.1/1 imply conditions (ii’) 
and (iii’) of 1.1/2. Conversely, let G be a set with an associative law of com- 
position such that there is a left neutral element e € G, as well as for every 
element a € G a left inverse 6 € G. We show first that b is also a right inverse of 
a. Indeed, let ba = e. Then b admits a left inverse c such that cb = e. However, 
this implies 

ab = eab = cbab = ch =e. 


Hence, if b is a left inverse of a, it is a right inverse of a as well, so that 
condition 1.1/1(iii) is met. It remains to show that the left neutral element 
e € Gis right neutral as well. Let a € G. If b € Gis a left inverse of a, it is also 
a right inverse of a, as we have seen, and we get 


ae = aba = ea =a, 


justifying condition 1.1/1(ii). 


1.1, Exercise 2. We show that there cannot exist a group isomorphism between 
Q and Qyo, due to incompatible properties of the two groups. Given x € Q, 
there is always a rational number y € Q such that « = y+y, namely y = $0. 
However, the corresponding assertion, that for every x € Qs there is an element 
y € Qso such that « = y- y, is false. Indeed, for x = 2 there does not exist, 
as one knows, a rational number y whose square equals 2. This is proved by 
relying on the unique prime factorization of integers. Now, if there exists an 
isomorphism of groups y: Q —> Qso, the surjectivity of vy shows that there 
exists an element a € Q satisfying y(a) = 2. But then, writing b = $a, we would 
get y(b)? = y(2b) = y(a) = 2, contradicting the fact that 2 does not admit a 
rational square root. 
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1.2, Exercise 1. Since H is of index 2 in G, we see that G is the union of two 
disjoint left cosets of H, namely of H itself and of another one denoted by H’ 
that necessarily coincides with the complement of H in G. The same argument 
works for right cosets of H as well, and it follows that H’ is both a left and a 
right coset of H. Now let a € G. If a € H, we get aH = Ha for trivial reasons. 
On the other hand, if a ¢ H, then the two cosets aH and Ha are both different 
from H and hence must coincide with H’. Therefore, also in this case we get 
aH = Ha, so that A is normal in G. 

To see that a subgroup of index 3 is not necessarily normal, let us look at 
the symmetric group G3. Let a € G3 be the permutation that interchanges the 
numbers 1 and 2, leaving 3 fixed. Then H := {id,o} C G3 is a subgroup of 
order 2 and hence a subgroup of index 3 by the theorem of Lagrange 1.2/3, 
since ord G3 = 6. Now let 7 € G3 be the permutation that leaves 1 fixed and 
interchanges 2 and 3. Then Toao7~! interchanges the numbers 1 and 3, leaving 
2 fixed, and therefore does not belong to H. Thus, we get TH # Hr, and H 
cannot be normal in 63. 


1.2, Exercise 2. To begin with, let us assume that N is a subgroup in G, not 
necessarily normal. Then 7,: G —> G, a ++ ga, the left translation by an 
element g € G, maps left cosets of N onto sets of the same type. Thereby 7, 
gives rise to a map T,: X —+> X that is characterized by aN +> gaN. Since 
gaN = ga'N for a,a’ € G implies aN = a’N, we conclude that 7, is injective. 
On the other hand, T, is surjective as well, since T, is surjective. Thus, T, is 
bijective and thereby satisfies T, € S(X). Furthermore, g + 7, defines a map 
yp: G —+ S(X), which is even a group homomorphism, due to the relation 
Tag! = Ty ° Tg for g,g' € G. Let us look at the kernel of y. Indeed, an element 
g € G is contained in ker y if and only if T,: X —>+ X is the identity map, i-e., 
if and only if gaN = aN for all a € G. This equation is equivalent to ga € aN, 
resp. g € aNa‘', so that we obtain ker p = (),<gaNa7'. Now, if we assume 
that N is normal in G, this implies aNa~t = N for all a € G and therefore 
ker yy = N. Thus, writing G = y(G), we have constructed for every normal 
subgroup N C G a group G together with a surjective group homomorphism 
p: G — G satisfying ker p = N. 

We could view G as “the” factor group of G modulo N. In particular, 
this makes sense if we understand by G the specific subgroup y(G) C S(G) 
constructed above. However, a slightly more general point of view is more ap- 
propriate, namely to refer to an arbitrary pair (G,p), where p: G —> G isa 
surjective group homomorphism with kernel ker p = N, as a “factor group” of G 
modulo N. For such a homomorphism p: G —> G we can prove the fundamen- 
tal theorem on homomorphisms 1.2/6 in the same way as we did in Section 1.2 
for the explicitly constructed surjective group homomorphism 7: G —> G/N. 
As a consequence, all “factor groups” (G, p) are canonically isomorphic, in par- 
ticular, isomorphic to the explicitly constructed “factor group” (G/N, 7). 

1.3, Exercise 1. First observe that the law of composition “o” is commutative. 


To verify the associativity, consider elements a,b,c € Gm. According to the 
definition of “o”, there are integers q,q' € Z satisfying 
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a+b=qm+(aob), (aob)+c=q'm+ ((a0b) oc), 


which yields 
a+b+c=(q+q)m+ ((a0b) oc). 


Therefore, (ao b) oc is the remainder of a+ b+ c on division by m. In the same 
way we see that also ao (boc) equals the remainder of a + 6 +c on division 
by m. Hence, we get (ao b)oc=ao (boc), and the law of composition “o” is 
associative. The remaining axioms are easily checked. 0 serves as a zero element 
of the law “o”, and m—a, for a € Gp, a 4 0, is an inverse of a. Thus, Gy, is a 
commutative group. 

To exhibit an isomorphism between G,, and Z/mZ, consider the bijection 
t: Gm — Z/mZ, a+ a+mdZ. Since the integers aob and a+b for a,b € Gy, 
can differ only by a multiple of m, we get (ao b) +mZ = (a+b) +mZ and 
hence u(ao b) = v(a) + 2(b). It follows that v is a group isomorphism. 


1.3, Exercise 2. Look at the epimorphism 7: Z —> Z/mZ that is given 
by a ++ a+m2Z, and observe that the preimage 7~!(H) of any subgroup 
H C Z/mZ yields a subgroup of Z containing mZ. On the other hand, the 
image 7(/H) of any subgroup H C Z is a subgroup in Z/m2Z. In this way, it is 
easily seen that H +> 7~1(H) gives rise to a bijection between the subgroups 
H C Z/mZ and the subgroups H Cc Z containing mZ. 

Let us determine all subgroups H Cc Z containing mZ. If H is such a 
subgroup, it is cyclic by 1.3/4, say H = dZ. Furthermore, the inclusion mZ Cc dZ 
shows that that m admits a factorization m = cd for some c € Z, so that 
d divides m. On the other hand, every divisor d of m leads to an inclusion 
mZ C dZ. Therefore, the subgroups in Z that contain mZ correspond bijectively 
to the groups of type dZ, where d divides m. Since the generating element d of 
such a subgroup dZ C Z is unique up to sign, these groups correspond bijectively 
to the positive divisors of m. 

Now, to specify all subgroups of Z/mZ, we have only to apply the epimor- 
phism 7 to the subgroups dZ C Z just considered, where d varies over the 
positive divisors of m. Since dZ is cyclic with generating element d, its image 
m(dZ) is cyclic as well, generated by m(d) = d+ mZ. The order of the latter 
group equals 4, so that the index of 7(dZ) in Z/mZ is d; cf. 1.2/3. Therefore, 
we can state that for every positive divisor d of m, there is a unique subgroup 
H Cc Z/mZ of index d, namely the cyclic subgroup generated by d+ mZ, and 
there are no further subgroups in Z/mZ. Relying on the fact that every cyclic 
group of order m is isomorphic to Z/mZ, we can also state more generally that 
for every cyclic group of order m, there exists a unique subgroup of index d 
for every positive divisor d of m and, in view of 1.2/3, a unique subgroup of 
order d. 

Finally, let us point out that the preceding result can also be obtained in a 
direct way, without looking at corresponding subgroups in Z, if we are willing 
to use the ideal-theoretic characterization of the greatest common divisor of 
integers in Z. A basic step of proof consists in showing that a given subgroup 
H C Z/mZ is generated by the residue class d of some divisor d of m. To justify 
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this, consider elements a1,...,a, € Z, whose residue classes @,...,@, € Z/mZ 
generate the group H. If d is the greatest common divisor of a1,...,a@,,m, then 
there is an equation of type d = cya, +... + ¢,a, + cm for suitable coefficients 
C1,---,Cp,€ € Z; see, for example, 2.4/13. From this we can conclude that H is 
already generated by the residue class d of d. 


2.1, Exercise 1. We obtain 0-a+0-a = (0+0)-a = 0-a and therefore 
0-a=0 for alla € R by applying the distributive law. In a similar way, we get 
a-b+(—a)-b= (a+ (—a))-b6=0-6=0. Hence, (—a) - b is an inverse of a - b 
with respect to addition, and we see that (—a)-b = —(a- b). 


2.1, Exercise 2. The construction of polynomial rings R[X] in 2.1 did not 
require R to be commutative. Therefore, we can consider the polynomial ring 
R[X] over an arbitrary not necessarily commutative ring R. Observe that the 
resulting multiplication on RLX ] satisfies aX = Xa for all elements a € R, ie., 
that X commutes with the elements of R. In addition, if R C R’ is an extension 
of not necessarily commutative rings, we can evaluate polynomials of R[X] at 
elements x € R’ as usual, by substituting the variable X by x. The addition on 
R[X] is compatible with such a process, i.e., we get (f+g)(a) = f(x) +g(2) for 
fig © RLX] and x € R’. However, as a rule, the corresponding multiplicative 
equation (f+g)(a) = f(x)- g(a) is valid only if x commutes with the elements of 
R, in the sense that az = xa for all a € R. For example, thinking of the trivial 
extension R C R and taking x € R, this is the reason why polynomial rings, as 
introduced in 2.1, should be considered only over a commutative coefficient ring 
R. On the other hand, if we want to evaluate polynomials of R[X] on a ring 
R’ strictly extending R, it is not really necessary for R’ to be commutative. As 
indicated before, it is enough that the elements of R commute with those of R’. 


2.2, Exercise 1. From a = )%j", Ra; and b = )7_, Rb; we get immediately 
a+b = Soi", Ra; +) 75-1 Rb; and thereby see that a1,...,@m,b1,...,bn generate 
the ideal a + 6. Next we want to show that the elements a,;b;, 7 = 1,...,m, 
j=1,...,n, give rise to a system of generators of a- 6. Let q be the ideal that 
is generated by these elements. Since a;b; € a-6 for all 7 and 7, we get q C a- 6. 
To verify the reverse inclusion consider an element z € a:b. Then z is a finite 
sum of type z = )>) ay) for suitable elements a, € a, 3, € b, and there are 
elements cyi,d,; € R such that a, = )7i",c\a; as well as By = ae dy;b;. 
However, this implies a,(3, = ae Cyidyja;b; € q and hence z € q. Therefore, 
q =a-6, and the elements a;b; generate the ideal a- b. 

Concerning the ideal aM b, it is more complicated to construct a system of 
generators from the elements a; and 6;. Let us consider the case R = Z as an 
example. Then a is generated by the greatest common divisor a of the elements 
a;, and likewise 6 by the greatest common divisor 6 of all b;; for example, 
assertions of this type are proved in 2.4/13. Furthermore, the ideal af 6 is 
generated by the least common multiple of a and 0; cf. 2.4/13 again. However, 
such a description of a generating element of aM 6 is valid only for principal 
ideal domains. In more general settings it is totally unclear how to arrive at a 
system of generators for aM b. 
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2.2, Exercise 2. Let a, 6 be ideals of a ring R. We claim that aUb is an ideal in 
Rif and only if a C b or b Ca. If one of these inclusions holds, say a C 6, then 
aUb = 6 is clearly an ideal in R. On the other hand, if a ¢ b and b C a, then 
there exists an element a € a that is not contained in 6, as well as an element 
b € 6 that is not contained in a. This implies that a+ b is contained neither in 
a nor in b. Consequently, aU b is not closed with respect to addition and hence 
cannot be an ideal in R. This justifies our claim. 

It is not easy to decide for a family (a;)je7 of ideals in R consisting of more 
than two elements whether the union a = U<; a; is also an ideal. Certainly, a 
is closed under the multiplication by elements of R, as well as closed under the 
formation of inverses with respect to addition. Therefore, it remains to check 
whether a is closed under addition, i.e., whether a,b € a always implies a+b € a. 
For example, a sufficient condition for this is that for any two indices i,7 € I 
and elements a € a;, b € a;, there is always an index k € J such that a,b € dg. 
In particular, the union of an ascending chain of ideals a; C ag C ... is again 
an ideal. 


2.2, Exercise 3. It is quite easy to determine the ideals in K?. We claim that 
except for 0, K x 0, 0 x K, K?, there are no further ideals in K?. Indeed, 
consider an ideal a C K?. If a contains an element (a,b), where a £4 0 ¥ b, then 
(1,1) = (a1, b-1)(a, b) € a, ie., a contains the unit element of K?, and we have 
a= K?. Next, if a does not contain an element (a,b) such that a 4 0 4 b, then 
a consists only of elements of type (a,0) or (0,6). Since (a,0) + (0,6) = (a,b), 
we see that nontrivial elements of type (a,0) and (0,6) cannot be present in a 
at the same time. Therefore, let us assume that all elements of a are of type 
(a,0). Then a is either the zero ideal, or there is an element (a,0) in a such 
that a # 0. In the latter case we get (1,0) = (a~',1)(a,0) € a and therefore 
a=k x0. 

In particular, we can see that all ideals are subvector spaces in K?. There is 
a general reason for this. Indeed, considering the so-called diagonal embedding 
Kk —> K?, a> (a,a), we may identify K with its image A in K? and thereby 
view K = A as a subring of K?. Then the product av in the sense of K? as a 
K-vector space coincides for elements a € K and v € K? with the product av 
of the ring multiplication in K?. Since ideals are closed under multiplication by 
elements of K?, we see once again that every ideal in K? is a K-subvector space 
in K?. Since the same is true for every subring of K? containing the diagonal 
A, we conclude by reasons of dimension that there cannot exist such subrings, 
except for A and K? themselves. Also we see that A is an example of a subvector 
space in K? that does not admit the properties of an ideal. Furthermore, A is 
the only subvector space that is at the same time a subring in K?. 

If K consists of more than two elements, one can show that K? contains 
further subvector spaces beyond the ones already mentioned. In general, there 
will exist further proper subrings in K? besides A, in particular, subrings that 
are not contained in A. 


2.3, Exercise 1. Certainly, the image y(a) of an ideal a C R is a subgroup 
of R’. But it is not an ideal in general, since y(a) does not need to be closed 
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with respect to multiplication by elements of R’. For example, consider the 
ring homomorphism Z — Q and observe for m > 1 that mZ is an ideal in 
Z, but not in Q. Indeed, Q is a field and therefore contains only the trivial 
ideals. The situation is different if we assume y: R —> R’ to be surjective. 
In such a case the image y(a) of an ideal a C R is always an ideal in R’. 
To verify that y(a) is closed with respect to multiplication by elements of R’, 
consider elements r’ € R’, a’ € (a), as well as preimages r € R, a € a. 
Then we have ra € a and hence r’a’ = y(ra) € y(a). Let us examine the 
circumstances under which the ideal y(a) is prime or maximal in R’. To do 
this, look at the composition 7: R —> R' —> R'/y(a) of y with the canonical 
projection R’ —+ R’/ (a), of course under the assumption that ¢ is surjective. 
As a composition of surjective ring homomorphisms, w is again a surjective ring 
homomorphism. Its kernel equals a + ker y, so that R’/y(a) is isomorphic to 
R/(a+ker y) by 2.3/5. Therefore, using 2.3/8, it follows that y(a) is prime (resp. 
maximal) if and only if R’/y(a) is an integral domain (resp. a field), i-e., if and 
only if a+ ker y is prime (resp. maximal) in R. In particular, the image y(a) of 
a prime (resp. maximal) ideal a is prime (resp. maximal) if a contains ker y. 

Next we look at the preimage a = y '(a’) of an ideal a’ C R’, for an 
arbitrary ring homomorphism y: R —> R’. It is easily verified that a is an 
ideal in R. Indeed, to show that a is closed under multiplication by R, consider 
elements r € R and a € a. Then we have y(ra) = y(r)y(a) € a’ and hence 
ra € » ‘(a’) =a. To deal with the question whether a is prime or maximal in 
R, consider the composition 7): R —+ R’ —> R'/a’ again, which now satisfies 
ker y) = a. We conclude from 2.3/4 that w induces an injective homomorphism 
w: R/a —>+ R'/a’. Now if a’ is prime in R’, we see from 2.3/8 that R’/a’ is an 
integral domain. Then the same is true for R/a, and a is a prime ideal in R. 
However, the same argument does not work for maximal ideals instead of prime 
ideals, since the map 7 does not need to be surjective. Indeed, the preimage 
aC R of a maximal ideal a’ C R’ is not necessarily maximal. Just consider the 
inclusion map Z <> Q, as well as the ideal a’ = 0. It is maximal in Q, but its 
preimage a = 0 is not maximal in Z. 

In particular, if y is surjective, the argument given above shows that the 
ideals (resp. prime ideals, resp. maximal ideals) in R’ correspond bijectively to 
the ideals (resp. prime ideals, resp. maximal ideals) in R that contain ker y. 


2.3, Exercise 2. We claim that ker y, equals the principal ideal (X — x) that 
is generated by X — x. Clearly, we have X — x € ker y,. On the other hand, 
given any element f € ker y,, we may apply Euclidean division 2.1/4 and write 
f =qdX —2) +r for a polynomial r € RLX] of degree < 1, i-e., for a constant 
r € R. However, since yz(r) = yz(f) = 0, this means that r = 0 and therefore 
f € (xX — 2). Thus, ker y, = (X — 2). 

Using the surjectivity of y,, the fundamental theorem 2.3/5 on homomor- 
phisms yields an isomorphism R[X]/ ker y, “> R. Furthermore, 2.3/8 shows 
that ker y, is prime if and only if R is an integral domain, and maximal if and 
only if R is a field. 
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2.4, Exercise 1. Let R be a ring. We want to show that the polynomial ring 
R[X] is a principal ideal domain if and only if R is a field. The condition is 
sufficient, as we have seen in 2.4/3. To show that it is necessary as well, assume 
that R[X] is a principal ideal domain. In particular, RLX] is then an integral 
domain, and the same is true for R. Furthermore, observe that the element X 
is irreducible in R[X]. Indeed, consider a factorization X = fg for polynomials 
f,g © R[X]. Then 2.1/2 yields deg f + deg g = 1, say deg f = 0 and deg g = 1. 
Therefore, the polynomial f is constant, i.e., is given by an element in R, and the 
product of f with the coefficient of degree 1 in g equals 1, due to the equation 
X = fg. However, this shows that f is a unit in R and R[X], so that X is 
irreducible in RX]. 

Now consider the substitution homomorphism y: R[X] — R, h+- h(0). 
It is surjective and satisfies kery = (X). Therefore, it gives rise to an iso- 
morphism R[X]/(X) ~ R, due to the fundamental theorem on homomor- 
phisms 2.3/5. Since X is irreducible, we conclude from 2.4/6 that the ideal (X) 
is maximal in R[X]. But then RLX]/(X) ~ RB is a field by 2.3/8. 


2.4, Exercise 2. Let R be a unique factorization domain. If the ideal gener- 
ated by two elements x,y € R is always principal in R, then, by an inductive 
argument, every finitely generated ideal in R is principal. Furthermore, using 
the fact that R is a unique factorization domain, we can conclude that every 
ideal in R is principal and hence that R is a principal ideal domain. Indeed, if 
there exists an ideal a C R that is not finitely generated, one can find in aa 
sequence of elements aj, d2,... giving rise to a strictly ascending chain of ideals 


(a1) © (a1, a2) € (ay, a2, a3) ©... . 


Since each member of this chain is finitely generated and hence principal, say 
generated by some element x;, we can write the chain just as well as 


(#1) S (@e) S (@s) Gace, 


where in each case x;41 1s a nontrivial divisor of x;. This implies that the number 
of prime factors in the prime factorization of 24, will be strictly smaller than 
the corresponding number of prime factors for x7;. As a consequence, there can- 
not exist an infinite chain of the above type. Hence, every ideal of R is finitely 
generated, and therefore principal, so that R is a principal ideal domain. In par- 
ticular, we thereby see that the ideal-theoretic characterization of the greatest 
common divisor is generally valid only in principal ideal domains. 

The situation is different for the least common multiple v of two elements 
x,y € R. Indeed, we get (x) 1 (y) = (wv), even if R is just a unique factorization 
domain. This is easy to justify. Since v is a multiple of x and y, we obtain 
(x) (y) D (v). On the other hand, if a € (x) N (y), ie, if @ is a common 
multiple of « and y, then a is a multiple of v by its definition, and we get 
a € (v), resp. (2) MN (y) C (v). 

2.5, Exercise 1. Let R be a (commutative) ring and M a not necessarily 
commutative monoid. Then the polynomial ring R[M/] can be constructed as 
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in 2.5, since the commutativity of M was not really used. However, care is 
required if the law of composition on / is still written additively. Indeed, if M 
is not commutative, there are elements y,v € M such that wt+v Av+yp. In 
particular, the product X“.X” = X“*” differs from the product X”-X4 = X’t# 
then, so that R[M] is a ring that is no longer commutative. In accordance with 
this, one should not limit 2.5/1 to commutative extension rings R’ of R. Instead, 
one can admit more general rings R’ whose elements commute with those of R. 
The assertion of 2.5/1, including its proof, remains valid without changes. 


2.5, Exercise 2. The results 2.5/2, 2.5/3, and 2.5/4 remain valid without 
changes if we replace R[X1,..., Xn] by the polynomial ring R[X] in an arbi- 
trary system X = (X;)icr of variables X;. The argument is that each element of 
R[X] is, in fact, a polynomial in finitely many of the variables X;. Therefore, 
it is enough to know the corresponding results for polynomial rings in finitely 
many variables. For example, consider the assertion of 2.5/4. First we see that 
R* c (R[X])* as usual. On the other hand, if f is a unit in R[X], there is a 
polynomial g € R[X] such that fg = 1. Since both f and g are polynomials in 
finitely many variables, we can read the equation fg = 1 just as well in a subring 
of type RLX;,,...,X;,] C R[%X]. In particular, f is a unit in RLX;,,...,Xi,]. 
However, in this case we know already that f is a unit in R. 

Also 2.5/5 can be generalized to systems of arbitrarily many variables: Given 
aring homomorphism y: R —+ R’ and a system (x;)ic; of elements in R’, there 
is a unique ring homomorphism ®: R[X;; i € I] —> R’ satisfying ®|p = y 
and @(X;) = a; for all i € J. It is possible to deduce this assertion from 
2.5/5, by considering extensions of y of type R[X;,,...,Xi,] —> R’, where 
Xj, > v;,, and using their uniqueness. However, it is more natural to observe 
that a monoid homomorphism N“) —+ R’ is uniquely determined by the images 
of the elements e; = (di;)ier, 7 € I, and that these images are not subject to 
any restrictions. Then one can apply 2.5/1. 


2.5, Exercise 3. We proceed by a repeated application of 2.5/1. To begin with, 
let &': RLM] — RLM x M’) be the ring homomorphism that is given by the 
canonical map R > R[M x M'] in conjunction with the monoid homomorphism 
M — R[M x M’], » +> X™°). Furthermore, there is a homomorphism 
®: R[M][M'] — RLM x M'] that extends & and otherwise corresponds to 
the monoid homomorphism M’ —>+ R[M x M’], v —+ X@”). On the other 
hand, we can define a ring homomorphism ¥: R[M x M'] —> RLM][M’‘] by 
the canonical map R > R[M] © R[M][M'] and the monoid homomorphism 
M x M' — R[M][M’‘], (u,v) X"-X”. We claim that @ and W are inverse 
to each other, i.e., that @oW = id and Wo @ = id. Indeed, o W and the 
identity map constitute two ring homomorphisms R[M x M'] —> RLM x M‘} 
that extend the canonical map R<> RLM x M’] and satisfy X“%") Hs XxX), 
thereby corresponding to the monoid homomorphism M x M’ —> R[M x M"], 
(u,v) + X™”), Then the uniqueness assertion in 2.5/1 shows that BoW = id. 
In a similar way one obtains Y o @ = id, first restricted to R[M], and then on 
all of R[LM][M’]. 
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2.6, Exercise 1. Let n > 1. Concluding by induction on n, we write the 
polynomial f € K[X,,...,X,] under consideration as a polynomial of type 
f =>o% f.Xi, with coefficients given by polynomials f; € K[X1,...,Xn-1]. 
Then, for points x = (2,...,¢,) € K", we get f(z) = °°, fi(x’)ai,, where 
wv = (#,...,%n-1). Assuming f(x) = 0 for all 2 € K”, the polynomial 
oy fi(x’)X? € KLX,] vanishes for every x’ € K"~! on all of K. Thus, we 
can apply 2.6/1 to conclude that f,(a’) = 0 for all i € N and all 2’ € K™"'. In 
particular, using the induction hypothesis in case n > 1, we get f; = 0 for all i 
and therefore f = 0. 


2.7, Exercise 1. First observe that the image y(p) of a prime element p € R is 
again a prime element. Therefore, if 2 = p,...pp is a prime factorization of an 
element « € R, then v(x) = y(p1)... ~(pp) is a prime factorization of its image 
v(x). In particular, we get vyip)(p(x)) = v(x) for x € R. Let us show that more 
generally, the equation 1p) (@(f)) = v(f) holds for all prime elements p € R 
and all polynomials f € R[X]. Considering besides @ also its inverse @~', it is 
enough to show for polynomials f #4 0 that v,(,)(@(f)) > vp(f) for all p. Now, 
if y»(f) =r = 0, we can view f =p" f asa polynomial in RX]. Furthermore, 
5(f) € R[X] implies v)(4(f)) > 0. Since (fF) = O(p" f) = g(py" Lf), we see 
that Vyip) (@(f)) = r = %»(f), as desired. Under the condition that y(p) is always 
associated to p, for example if y = ®|p = id, we get even v,(@(f)) = v,(f) for 
all prime elements p € R. 

A polynomial f € R[X] is primitive if and only if v,(f) = 0 for all prime 
elements p € R. Since ¢ is an isomorphism of R, it induces a bijection on the 
set of all classes of associated prime elements. Therefore, using the equality 
Vo(p) (P(f)) = Yp(f) for all p, we conclude that &(f) is primitive if and only if 
f is primitive. The automorphism ®@: R[X] —> RX], f > f(X +a), may 
serve as an example. Thereby we see that a polynomial f € R[X] is primitive 
if and only if f(X + a) is primitive. 


2.7, Exercise 2. Gauss’s lemma establishes for prime elements p € R and 
polynomials f,g € A[X] the formula vp(fg) = vp(f) + (9). If f,g 4 0, we 


can read from it 
Ile" = II pre) j II prr(9), 


pEeP pEeP pEeP 


in other words, the formula ay, = ay - a, for the content. Conversely, departing 
from this formula, it implies v,(asg) = Up(ar) + Yp(a,) for p € P. Since the 
content a), of a polynomial h # 0 is characterized by the relation v,(a;,) = v,(h), 
we get again v,(fg) = (ff) + v»(g). Therefore the assertion of Gauss’s lemma 
is equivalent to the formula ar, = af: ag for f,g # 0. 


2.9, Exercise 1. If M = T @ F is a decomposition into a torsion module T 
and a free module F’, then T is unique, since it will coincide with “the” torsion 
submodule of M. On the other hand, F' is not unique, except when T = 0 or 
T = M. Indeed, if we modify certain elements of a basis of F' by adding arbitrary 
torsion elements, the new system will generate a free submodule F’ Cc M that 
is different from F’ but satisfies T 6 F’ = M as well. 
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Similarly, there is no uniqueness for decompositions of type M = M'@ M", 
where M’ ~ A/p"A and M" ~ A/p*A, for a prime element p. For example, if 
r = s = 1, we can view M as an (A/p)-vector space. Then M = M' @ M” 
is a direct sum decomposition of a 2-dimensional (A/p)-vector space into two 
1-dimensional subspaces. However, such a decomposition is never unique. 


2.9, Exercise 2. We claim that Q is a torsion-free Z-module of rank 1 that 
is not free. Indeed, Q is of rank 1, and if it were a free Z-module, it would be 
generated by a single element x € Q, say x = § with relatively prime integers 


a,b € Z, a,b #0. Then, however, $ cannot be contained in Zz. 


2.9, Exercise 3. Let K be a field and V a finite-dimensional K-vector space, 
together with a K-endomorphism y: V —> V. A subvector space U C V 
is called y-invariant if p(U) C U. It is called y-cyclic if U is y-invariant 
and if there is a vector u € U such that the sequence u, y(u), y?(u),... gives 
rise to a K-generating system of U. Furthermore, U is called y-irreducible if 
U is y-invariant and cannot be decomposed into a direct sum of two proper 
y-invariant subvector spaces. As main results, the theory of canonical forms 
shows that V decomposes into a direct sum of y-irreducible subvector spaces 
and that each y-irreducible subvector space is y-cyclic. We will deduce these 
assertions from 2.9/8; see also [4], 6.3-6.5. 

In order to apply the methods of 2.9, we view V as a K[_X |-module under 
vy, by defining the multiplication by X on V through the application of —y; see 
the explanations given in 2.9. Then a K[X]-submodule U C V is the same as 
a y-invariant K-subvector space. Similarly, a A LX |-submodule generated by a 
single element is just a y-cyclic K-subvector space. In this way, a y-irreducible 
subvector space of V is recognized as a K[X]-submodule of V that cannot be 
written as a direct sum of two proper K LX ]-submodules. 

Since V is finitely generated as a K-vector space, the same is true for V 
as a K|X]-module. Furthermore, by reasons of vector space dimension, V is 
a K[X]-torsion module, so that 2.9/8 can be applied. Fixing a system P of 
representatives of the prime polynomials in A[X], we obtain a decomposition 


V~ BOQ AX/0"") 


pEP vp=1 


with unique integers r,,n(p,v»>) € N, where r, is zero for almost all p € P. 
Using the language of vector spaces, this is a decomposition of V into a direct 
sum of y-cyclic subvector spaces. Furthermore, one concludes from the unique- 
ness assertion in 2.9/8 that the occurring subspaces are even y-irreducible and 
in addition, that every y-irreducible subspace is y-cyclic. This establishes the 
results on canonical forms mentioned above. 

One can now consider special matrices that describe the endomorphism y 
with respect to appropriate K-bases of V. To do this, look at a decomposition 
V = @;_, Vi into ~-irreducible subvector spaces, choose a basis on each V;, and 
consider on V the basis that is composed of the chosen bases on the V;. Then the 
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corresponding matrix of vy is of “diagonal shape,” in the sense that on its “diago- 
nal” there are the matrices corresponding to the endomorphisms y; = »ly,, with 
zero entries otherwise. Therefore, it is enough to assume that V is y-irreducible, 
say V = K[X]/(p”) for a prime element p € P. If X € K[X]/(p") denotes the 
residue class of X, then the elements 1,X,X?,...,X™~!, for m = n- (deg p), 
form a K-basis of V, and the corresponding matrix of y is determined to be 


0 0 0 O —-—Cm 
1 0 O 0° S64 
0 0 1 0 —C2 
0 0 O ol —Cy 


where p? = X™+cX™1!4+4...+ Gm is the minimal polynomial of y. This 
is the rational canonical form, also referred to as the Frobenius normal form 
of y. Especially, if p is of degree 1, say p = X — c, we can take just as well 
1,X —c,(X —c)?,...,(X —c)""! as a K-basis of V. Then the corresponding 
matrix of vy is of shape 


HO 
e) 

oo 
oo 


0 oO ... e O 
0 oO... 1 ¢ 


and we obtain the Jordan canonical form of y. 


3.1, Exercise 1. Let ¢: R —> R’ be a homomorphism between two integral do- 
mains R, R’ of characteristic p, resp. p’. Then the homomorphisms y: Z —> R, 
nt>n-1, and gy’: Z —+ R,n -> n-1, satisfy kerry = pZ, as well as 
ker vy’ = p'Z. Now observe that there is a unique ring homomorphism from Z to 
R’. Therefore, we get y’ = a0 vy and hence ker y C ker y’, which implies p’ | p. 
Furthermore, we even have ker y = ker y’, and hence p = p’, if o is injective. In 
particular, since homomorphisms of fields are always injective, it follows that 
there cannot exist homomorphisms between fields of different characteristic. 

On the other hand, the canonical map Z —> Z/p'Z for p’ prime yields an 
example of a homomorphism between integral domains of characteristic 0 and p’. 
However, this is the only case of “mixed” characteristic that can occur. Indeed, 
if ¢: R —>+ R’ is a homomorphism between integral domains of characteristic p 
and p’, then p'|p, as we have seen. Since p and p’ are (positive) prime numbers 
if they do not vanish, we necessarily get p = 0 for p £ p’. 


3.2, Exercise 1. We have to consider a field extension L/K, and two elements 
a,b € L that are algebraic over kK. 'To show that a+ b is algebraic over K, we 
could try to explicitly construct from the two minimal polynomials of a and ba 
nontrivial polynomial admitting a+b as a zero. However, experience shows that 
such a procedure is not really practicable. A superficial reason for it consists in 
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the fact that in looking at an expression f(a+ 0) for a polynomial f € AK [|X] of 
degree > 2, the quantities a and b cannot be “separated” in general, for instance 
by writing f(a +) as a sum of a polynomial in a and a polynomial in b. Just 
consider the field extension C/Q and the algebraic numbers a = J/2 and b= V3 
as an example. The minimal polynomial of a is X? — 2, and that of b is X? — 3. 
Using the method of Exercise 7 in Section 3.2, the minimal polynomial of a+ 6b 
is determined to be X4—10X?+1. This is a polynomial for which no “obvious” 
relationship to the minimal polynomials X? — 2 and X? — 3 is visible. 

Thus, to verify the algebraicity of a + b, there is no other way than to 
apply the theory developed in Section 3.2. We know from 3.2/6 that K(a)/K 
and K(a,b)/K(a) are finite field extensions. Then K(a,b)/K is finite by the 
multiplicativity formula 3.2/2 and hence algebraic by 3.2/7. In particular, the 
sum a+b € K(a,b) is algebraic over K. 


3.2, Exercise 2. We have shown in 3.2/7 that every finite field extension is 
algebraic. On the other hand, the example of the algebraic closure Q of Q in 
C shows that the converse of this result is not true. However, we can say that 
a field extension L/K is algebraic if and only if there is a family (Lj);er of 
intermediate fields of L/K such that L = U,-,; Li and L;/K is finite for every i. 
Indeed, if this condition holds, then there exists for each a € L an index 7 € I 
such that a € L;. It follows that a is algebraic over K and hence that L is 
algebraic over K. On the other hand, if L/K is algebraic, then L is the union 
of all intermediate fields K (a), where a varies over L. Furthermore, K(a)/K is 
finite by 3.2/6. 

We want to add that the algebraicity of a field extension L/K can also be 
characterized by the following condition: Every subfield L’ of L that is finitely 
generated over Kt is finite over AK. Indeed, if L/K is algebraic and if L’ is a 
subfield of L that is finitely generated over K, then L'/K is finite by 3.2/9. 
Hence, the condition is necessary. But it is also sufficient. For every a € L, the 
field extension K'(a)/K is simple and hence finitely generated. If it is finite in 
each case, it is algebraic by 3.2/7, and we see that L/K is algebraic. 


3.2, Exercise 3. Consider an element a € C that is not contained in the 
algebraic closure Q of Q in C. If a is not transcendental over Q, it is algebraic 
over Q. However, then a is algebraic already over Q by 3.2/12 and hence must 
belong to Q, in contradiction to the choice of a. 


3.3, Exercise 1. Assume A ¥ 0 and hence B ¥ 0. As usual, given b € B, we 
consider the ring homomorphism y: A[Y] —> B that extends the inclusion 
A< Band maps Y to b. Since 6 satisfies an integral equation over A, it follows 
that ker y contains certain monic polynomials; choose one with minimal degree 
among these, say f. Now, if A is a field K, then f € K[Y] is uniquely deter- 
mined by b. Indeed, ker y is a principal ideal in A [Y], and ker y is generated 
by f. Since such a generator is unique up to a unit, we see that f is unique up 
to a unit in K[Y], ie., up to a constant in K*. Therefore, assuming f to be 
monic, it is uniquely determined by 0. 
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In the general case, however, ker y does not need to be principal in A[Y ]. In 
fact, it is most likely that there will exist several monic polynomials of minimal 
degree in ker y. As a consequence, none of these can be interpreted as being 
“the” minimal polynomial of 6 over A. In particular, looking at the example 
A= {0 ¢X* € K[X]; c, = 0} C K[X] = B given in the exercise, then 


Yak. Var er a) 


are two different monic polynomials of minimal degree in A[Y] that admit 
b:= X as a zero. But none of these will generate the ideal ker vy. 


3.4, Exercise 1. We assume that the polynomial f € Q[X] is irreducible; 
otherwise, we could replace f by an irreducible factor. Then we know from Kro- 
necker’s construction, Proposition 3.4/1, that QLX]/(f) can be viewed as an 
extension field of Q and that the residue class X of the variable X is a zero of f. 
Thereby we have enlarged Q in a minimal way, so to speak, with the only intent 
to get hold of a zero of f, without relating it to the real or complex numbers. 
Studying the same problem in analysis, one constructs first of all from Q the 
field R of real numbers in terms of topological arguments, and subsequently 
the field C of complex numbers. Only thereafter is one concerned with zeros 
of polynomials in these special fields. The construction of such zeros uses in a 
fundamental way approximation and limit techniques, since the defining prop- 
erties of R and C must come into play, in particular the completeness. Finally, 
if a zero a € C of f has been exhibited, the homomorphism Q —> C can be 
extended by 3.4/8 to a homomorphism Q[X]/(f) —> C, mapping X to a. 


3.4, Exercise 2. In order to apply Zorn’s lemma one needs a partially ordered 
set. However, in general the “collection” of all algebraic extensions of K fails to 
admit the properties of a set. 

On the other hand, the proposed argument can be remedied in a certain 
sense if some set-theoretic precautions are respected. Indeed, consider the power 
set P of K, and view K as a subset of P via the map K —> P, a+ {a}. 
Let M be the set of all pairs (L,«) consisting of a set L, where K C LC P, 
and a field structure « on L extending the given field structure on KA, and 
defining L as an algebraic extension of kK. Then M is partially ordered by 
writing (L,«) < (L’,4’) if LC L’ and if «’ restricts on L to «. As usual, the 
argument on unions shows that every totally ordered subset of 1 admits an 
upper bound in M. Therefore, we can conclude from Zorn’s lemma 3.4/5 that 
M contains a maximal element. Let us denote such an element by (14, 41); it 
constitutes an algebraic extension of K. 

We claim that (11,1) is already an algebraic closure of A’, assuming that 
kK consists of infinitely many elements. To justify this we have to show that 
(L1, 1) does not admit any nontrivial algebraic extensions. Therefore, consider 
an algebraic extension E of (£1, 1), which then is algebraic over K as well. 
Now we use some facts on cardinalities of sets, namely that AK and KX] (for 
K infinite) are of same cardinality. Thus, they have the same cardinality as L, 
and EF, since those fields can be viewed as unions of zero sets of polynomials in 


336 Appendix 


K(X]. However, P as the power set of A admits a cardinality that is strictly 
bigger than that of K, resp. EF. Therefore, the same is true for P — Ly, and it 
follows that the inclusion L; — P can be extended to an inclusion E <4 P. As 
a result, we get an element (Lo, 2) € M such that (£1, #1) < (Le, kg). But then 
the maximality of (Zi, 1) yields L,; = Le, resp. (L1,«1) = EF, and it follows 
that (Z1, 41) is an algebraic closure of K. For finite fields K, the argument can 
easily be modified. Just enlarge A to an infinite set AK’ and define P as the 
power set of A’. 


3.4, Exercise 3. Although two algebraic closures K, and K> of a field K 
are isomorphic over K, there usually exist several different K-isomorphisms 
K, — Ko, i.e., isomorphisms leaving K fixed; cf. 3.4/8 in conjunction with the 
construction process in the proof of 3.4/9. If we would talk about “the” algebraic 
closure K of K, we would allege an identification of all possible algebraic closures 
of K. Indeed, given two such closures K; and Ke we would have to fix a special 
isomorphism y;;: A; 4 K,; such that yij|« = idx, where the compatibility 
relation Qik = Yjr° Yi for any three indices 7, j,k must be respected. However, 
since canonical choices for such K-isomorphisms do not exist, except for trivial 
cases, an identification of all algebraic closures of K is rather problematic. 


3.5, Exercise 1. Let L/K be a field extension of degree 2. Choosing an element 
a € L—K, we get 1 < [K(a): K] < 2 and hence [K(a) : K] = 2, which means 
that L = K(a). Let f € K[X] be the minimal polynomial of a over K; it is of 
degree 2. Then a is a zero of f, and the linear factor X — a divides f in L[X]. 
In particular, f decomposes completely over L into linear factors. Let a,b € L 
be the two zeros of f. Since L = K(a) = K(a,b), we see that L is a splitting 
field of f over K and hence that it is normal over K. 


3.5, Exercise 2. We are considering a splitting field L of a nonconstant poly- 
nomial f € K[X]. Furthermore, let g € KX] be an irreducible polynomial 
admitting a zero b in L. To see that LD contains all possible zeros of g, choose 
an algebraic closure L of L and let b,...,b, € L be the distinct zeros of g. 
Then there is for each i = 1,...,r a K-homomorphism o;: K(b) —> L such 
that o;(b) = b;, see 3.4/8, and we can extend o; by 3.4/9 to a K-homomorphism 
oj}: L —>+ L. 

It is enough to show that oj/(L) C L for i = 1,...,1r, since then all zeros 
bj,...,0, of g are contained in L, and hence g decomposes over L into linear 
factors. Since o} leaves the field K fixed, it maps zeros of f to zeros of f again. 
However, as L is generated over K by all zeros of f in L, we get o/(L) C L, as 
desired. 


3.5, Exercise 3. Choose an algebraic closure K of L. Then K is an algebraic 
closure of K as well, since L/K is algebraic. Now consider an element a € K with 
minimal polynomial f € A LX]. Since f is nonconstant and since L is a splitting 
field of all nonconstant polynomials in KX], it follows that f decomposes 
completely over L into linear factors. Thus, a € L and L = K, ie., L is an 
algebraic closure of K. 
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3.6, Exercise 1. We proceed aah as in 3.2, Exercise 1. Since a € L is 
separable over A, we read [K (a): K], = [K(a) : K'] from 3.6/6. Furthermore, 
b € Lis separable over K and in partienlar over K (a), so that we get in the same 
way [K (a,b) : K(a)], = [K(a,b) : K(a)]. Then we apply the multiplicativity 
formulas op and 3.6/7 to Ben that [AK (a,b): K], = [K(a,b) : K]. To 
see that a+b € K(a,b) is separable over K, we can use the implication from 
(iii) to (i) in 3.6/9. Alternatively, to give a more elementary argument, look at 
the extensions K C K(a+b) C K(a,6). Then 


[K(a, tie = [K(a,b): K(a+b)] -[K(a +b): K], 
[K (a,b): K], = [K(a,b): K(a +)],- [K(a +) : K]. 


Both terms on the left-hand side coincide. Since the separable degree equals at 
most the usual degree, see 3.6/6, the corresponding terms on the right-hand side 
must coincide as well, so that in particular, [AK (a+b): K],=[K(a+6): kK]. 
But then, again by 3.6/6, we conclude that a+ 0 is separable over K. 

The same argument can be applied to the elements a — b, ab, and ab~! if 
b £ 0. In this way, it follows that the elements of L that are separable over 
form an intermediate field of L/K. 


3.6, Exercise 2. Given two algebraic closures K, and K» of K, there is a 
K-isomorphism a: kK’, > Ko according to 3.4/10. Let f be a monic polynomial 
in KX] and consider its factorizations 


f=[[x-a" f=] fx- 


i=l =1 


— 


into powers of distinct linear factors in K[X], as well as in K»[X]. Then, using 
the uniqueness of prime factorization, 0 transports the first decomposition into 
the second. In particular, we have m = n, and up to a renumbering of the 6; 
even o(a i) = =); fori=1,...,m, as aol as r; = s;. Therefore, f admits multiple 
zeros in A, if and only if this’ is the case in K. 


3.6, Exercise 3. Let L/K bea finite separable field extension; we are interested 
only in the case that kK consists of infinitely many elements. Using recursion, 
we can restrict ourselves to the case L = K(a,b). Let f and g be the minimal 
polynomials of a and 6 over K, and let L’ be a splitting field of f, g over L. Then 
L’ is also a splitting field of f,g over K, and in fact a normal closure of L/K; 
cf. 3.5/7. In the proof of 3.6/12 we considered all K-homomorphisms oj,..., On 
from L into an algebraic closure K of K. However, since we can assume L' Cc K, 
it follows in this case from 3.5/4 that the images of the o; are already contained 
in L’. In other words, it is enough to choose a normal closure L'/K of L/K 
and to consider all K-homomorphisms oj,...,0, from L to L’. Then, looking 
for an element c € K such that o;(a+ cb) £ o;(a+ cb) for all i F 7, we get 
K(a,b) = K(a+ cb). 


3.7, Exercise 1. Assume that the elements a,b € L are purely inseparable 
over K. This means by 3.7/2 that there exist equations a?” = c and b?" = d for 


338 Appendix 


some elements c,d € K. Taking a suitable p-power of one of these equations, 
we may assume m = n. Then the binomial formula 3.1/3 yields the relation 
(a +b)?" = a?" +b?" =c+d. Furthermore, since (ab)?” = cd, this shows, us- 
ing 3.7/2 again, that a+b and ab are purely inseparable over A’. Alternatively, 
we can conclude similarly as in Exercise 1 of Section 3.2, or as in Exercise 1 of 
Section 3.6. 


3.7, Exercise 2. Purely inseparable field extensions L/K are characterized 
by the equation [L : AK], = 1. Alternatively, we could write this condition as 
[L: K]; = [L: K], however, only in cases in which the degree [L : K is finite. 
Thus, if we want to work with the inseparable degree instead of the separable 
degree, we always have to restrict ourselves to finitely generated extensions, 
similarly as we did when discussing separable extensions in Section 3.6. 


3.7, Exercise 3. Let K(a)/K be asimple algebraic field extension with minimal 
polynomial f € K[X] of a over K. As in 3.6/2, we can find a polynomial 
g € K[X] and a maximal exponent r such that f(X) = g(X”’). Then g is 
separable, and furthermore, it coincides with the minimal polynomial of a?" over 
K. In particular, K (a)/K(a?’) is purely inseparable and K(a?")/K is separable. 


3.8, Exercise 1. Fields of characteristic 0 are perfect (3.6/4). The same is 
true for finite fields, or more generally, for fields that are algebraic over a finite 
field (3.8/4). Thus, in order to construct an example of a nonseparable field 
extension, we must start out from an infinite field K of characteristic p > 0 
that is not algebraic over its prime subfield F,,. The simplest example of such a 
field is the function field K = F,(t) in a variable t. Adjoining to K a pth root 
of t yields a nontrivial purely inseparable field extension of A’. Applying the 
Frobenius homomorphism this extension can be identified with F,(t)/F,(t”). 


3.8, Exercise 2. If F is a finite field of characteristic p > 0 with g = p” 
elements, then F is a splitting field of the polynomial X? — X over F,. More 
precisely, F consists of the q zeros of this polynomial. Therefore, if F is a subfield 
of a field L, it is uniquely characterized by the number of its elements. 


3.9, Exercise 1. In the beginning of Section 3.9 we did not use the fact that the 
zeros of the corresponding polynomials are considered in the affine n-space of an 
algebraically closed field. In this way, the assertion of 3.9/1 remains valid if K’ 
is replaced by K and V(-) by Vi(-). Also we can read from 3.9/2 that algebraic 
sets of type Vx(£) can always be defined through finitely many polynomials in 
K[X] and hence are of type Vx«(fi,.--, f-). Furthermore, in 3.9/3 we obtain 
the relation Vx(I(U)) = U for subsets U C K” of type U = Vx(a) and ideals 
a Cc K[X]. However, the equation [(V(a)) = a for reduced ideals a C K(X], 
which is, so to speak, the essence of Hilbert’s Nullstellensatz 3.9/4, does not 
remain valid. Just consider for K = R and n = 1 the ideal a = (X?+1) C R[X]. 
Then Vg(a) = @ and hence I(Vg(a)) = R[X] # a. Therefore, in the setting of 
Hilbert’s Nullstellensatz, one cannot abandon the condition that the zeros are 
considered in the affine n-space of an algebraically closed field. 


Solutions to Exercises 339 


4.1, Exercise 1. If L/K is a finite Galois extension, then by the fundamen- 
tal theorem of Galois theory 4.1/6, the intermediate fields of L/K correspond 
bijectively to the subgroups of the Galois group Gal(L/K’). This fact was used 
in 4.1/8 to see that a finite separable field extension admits only finitely many 
intermediate fields, a result that does not extend to nonseparable (finite) exten- 
sions. Since the intermediate fields of L/K are characterized as the fixed fields of 
the subgroups of Gal(Z/K’), see 4.1/6 again, such fields can often be explicitly 
computed if the Galois automorphisms and the group structure of Gal(L/K) 
are sufficiently well known. Related to this fact is another aspect of Galois the- 
ory. To specify a finite Galois extension L/K is the same as to give a field L 
together with a finite group G of automorphisms on L. Indeed, from L/K we 
get the Galois group G = Gal(L/K) as a finite group of automorphisms on L, 
and from L together with such a group G we can rediscover K as the fixed field 
L® with respect to G; cf. 4.1/4 and 4.1/6. We will further deepen this point of 
view in Section 4.11, on Galois descent. 


4.1, Exercise 2. Let L/K be a quasi-Galois field extension with automorphism 
group G = Autx(L). Then L/L® is a Galois extension with Galois group G, 
according to 4.1/5 (i). Furthermore, L° = K; is (for char K > 0) the maximal 
purely inseparable extension of AK in L; cf. 3.7/5 and 4.1/5 (iii). Therefore, 
the assertion of 4.1/6 can be generalized by stating that the subgroups of G 
correspond, in the manner of 4.1/6, bijectively to those intermediate fields of 
L/K that contain the maximal purely inseparable extension A; as a subfield. 


4.1, Exercise 3. If L/K is a Galois extension with Gal(Z/K) = Aut, (ZL) as 
Galois group, then we conclude from 4.1/5 (ii) that A is the fixed field of L 
with respect to the automorphism group Aut, (LZ). The converse of this follows 
from 4.1/4. 


4.2, Exercise 1. If L/K is an arbitrary Galois extension, then L can be viewed 
as the union over the system (L;)je, of all intermediate fields of L/K that are 
finite and Galois over K’; see the beginning of Section 4.2. It follows that an 
element a € L, say a € Lj, is invariant under a subgroup H C Gal(L/K) if 
and only if a is invariant under the image H; = f;(H) of H with respect to 
the restriction map f;: Gal(ZL/K) —+ Gal(L;/K). In other words, we have 
LY L; = L#* and hence L” = U,,,; L/". On the other hand, if we consider 
an intermediate field E of L/K, we can look at the subgroup H = Gal(L/E) 
of Gal(L/) corresponding to E. Then H = (),<; f; '(Gal(L;/L;9 E)), as well 
as f;(H) = Gal(L;/L;N E), where the latter equality is obtained with the help 
of the extension argument 3.4/9. In terms of these formulas, the Galois theory 
of L/K is, so to speak, reduced to the Galois theories of the extensions L;/K. 
Making use of the fundamental theorem 4.1/6 for these extensions, we obtain for 
an intermediate field F of L/K with Galois group H = Gal(L/E) immediately 
L? QL, = EN L; and hence L# = E, since f;(H) = Gal(L;/L; E). On the 
other hand, departing from a subgroup H C Gal(L/K) and considering its fixed 
field L”, we get Gal(L/L") = ()j,<; f; ‘(fi(H)), a group that contains H and 
is different from H in general. In this respect the general version 4.2/3 of the 
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fundamental theorem of Galois theory differs from the version 4.1/6 for finite 
Galois extensions. 


4.2, Exercise 2. As we have just explained, the Galois theory of a Galois ex- 
tension L/K is characterized by the Galois theories of the extensions L;/K, 
i € I, where (L;);er is the system of all intermediate fields of L/A that are 
finite and Galois over K. In the light of this fact it is natural to identify a 
Galois automorphism 0: L —+ L with the system of its restrictions (o|,, )jer- 
Implementing this point of view in a consistent way, we arrive at the interpreta- 
tion of Gal(L/K) as a projective limit of the Galois groups Gal(L;/), hence of 
Gal(Z/F) as a profinite group. Thereby Gal(L/K) carries a natural topology, 
namely the one induced from the discrete topologies on the groups Gal(L;/K). 
As we have seen in 4.2/3, resp. 4.2/4, this topology is able to characterize those 
subgroups in Gal(L/) that occur as Galois groups Gal(L/£) for intermediate 
fields E of L/K; indeed, these are precisely the closed subgroups of Gal(L/K). 
On the other hand, if for an infinite Galois extension L/K, the corresponding 
Galois group Gal(L/K) is known merely as an abstract group, without any indi- 
cations regarding the corresponding topology, this is only of minor importance 
for characterizing the Galois theory of L/K. For the study of infinite Galois 
groups Gal(L/Kx), there is the choice of either introducing their topology in a 
direct way, as in 4.2/1, or relying on the formalism of projective limits. Usually 
the latter bears advantages for explicit computations; see, for example, 4.2/11. 


4.3, Exercise 1. Every group G can be viewed as a subgroup of the group 
of its bijective selfmaps G —> G, by identifying an element a € G with its 
corresponding left translation T,: G —> G, g ++ ag. Therefore, to solve our 
problem, it is enough to show that every subgroup G of a permutation group 6G, 
can occur as a Galois group. In fact, this is easy to achieve. Consider the rational 
function field L = k(T,,...,T;,) in n variables T,,...,7;, over a coefficient field 
k. Similarly as we did when considering the generic equation of degree n, we 
can view G as a subgroup of the isomorphism group of L, by interpreting the 
elements of G as permutations of the variables 7),...,7,. Then L/L is a 
Galois extension with Galois group G by 4.1/4. However, much more difficult 
and partially still open is the question whether a given finite group can be 
realized as the Galois group of an extension L/Q. 


4.4, Exercise 1. By its characterization given in 4.4/3, the discriminant A, 
of a monic polynomial f with coefficients in a ring R provides a certain mea- 
sure for the distance between the zeros of f, even if they become available only 
after extending R. To compute Ay, we could extend R until f factorizes com- 
pletely into linear polynomials, and then try to calculate the product over the 
squares of the differences of the zeros of f. However, such a procedure is not 
very practicable. Just think of the difficulties that occur in looking for explicit 
factorizations of polynomials with coefficients in Q,R, or C. Instead, one car- 
ries out a calculation in a “universal” setting and shows that the result can be 
transported via suitable ring homomorphisms to all other situations. Indeed, 
one considers the polynomial f = [[j_,(X — T;) in the variable X over the 
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coefficient ring Z[T),...,T;,]. At this point, the fundamental theorem on sym- 
metric polynomials 4.4/1 enables us to write the discriminant A; as an integer 
polynomial in the elementary symmetric polynomials s1,...,5,, which occur 
as the coefficients of f. The resulting solution for Ay can then be transported 
via ring homomorphisms to any other coefficient domain. In this way, one ob- 
tains a formula for A; that is valid independently of the coefficient rings under 
consideration. 

If we restrict the setting of the fundamental theorem on symmetric poly- 
nomials 4.4/1 to polynomials over a field AK, then the generic polynomial 
f =[[jL,(X — Tj) has to be seen over the coefficient ring K[T),...,T,]. Asa 
consequence, A, is obtained as a polynomial in s1,...,8,, however, now with 
coefficients that are known only to belong to K. This is why for variable charac- 
teristic of the fields under consideration, there is no chance any longer, to relate 
the different formulas for A; to each other. 


4.5, Exercise 1. Let @, = g,...9, be the prime factorization of the cyclotomic 
polynomial ®, € K[X] and observe that the factors gi,..., 9, are distinct, since 
®,, is separable. Since the zeros of the g; coincide with the primitive nth roots 
of unity, we can view every g; as the minimal polynomial over K of a primitive 
nth root of unity. Each of these roots generates the same extension field of K, 
namely A(¢), and we see that deg g; = [K(¢) : K] = s for all 7. Since ®,, is of 
degree y(n), we get r = y(n)/s, as claimed. 


4.5, Exercise 2. For m,n € N — {0}, consider primitive mth and nth roots 
of unity Gn; Gn € Q. Then we get [Q(Gn; Cn) : Q(Gn)] < deg ®, = y(n), since 
¢n is a zero of &,. Furthermore, we see that ®&,, is irreducible over Q(¢,,) if and 
only if [Q(Sm: Cn) : Q(m) ] = y(n), Le., using [Q(Gm) : Q] = y(m), if and only 
if [Q(Gm; Gn) : Q] = v(m) - y(n). To look more closely at the latter relation, 
we compute the degree of Q(¢m,¢n)/Q. Let k = lem(m,n). Then Q(Gm, Cn) 
contains a primitive Ath root of unity ¢ according to 3.6/13, so that we get 
Q(Gm, Gn) = Q(¢) and hence [Q(Gm; Gn): Q] = (k). 

Our considerations show that ®, is irreducible over Q(¢,) if and only if 
the equation y(lem(m,n)) = v(m) - y(n) holds. Now choose decompositions 
m= mgm’ and n = ngn’ such that lem(m,n) = mgng and ged(mo, ng) = 1, as 
in 3.6/13. Then we get 


y(Iem(m, n)) = v(m) - y(n) < v(m) - y(n) 


from 4.5/4, where equality holds if and only if (mo) = y(m) and y(no) = y(n). 
Further, we read from the explicit formula in 4.5/4 (iii) that y(mo) = y(m) is 
equivalent to m’ € {1,2}. The corresponding fact holds for the decomposi- 
tion n = non’, and we see that @, is irreducible over Q(G,,) if and only if 
gcd(m,n) € {1, 2}. 


4.6, Exercise 1. Assume F = F,, where g is a power of a prime number p. 
Then the multiplicative group of F, is cyclic of order q— 1 by 3.8/5. Therefore, 
we have to determine all group homomorphisms G —> Z/(q — 1)Z. Now let 
¢ be a generating element of G, where in a first step, we assume that ¢ is of 
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infinite order. Then we can define for every element a € Z/(q — 1)Z a unique 
group homomorphism G —> Z/(q—1)Z via ¢ +> a. Thus, in this case, there 
are q — 1 characters on G with values in F*. 

It remains to consider a cyclic group G of finite order m > 0. Indeed, if 
G — Z/(q—1)Z is a homomorphism with image a of ¢, we get m-a = 0, and 
the order of a divides m. On the other hand, we can define for each element 
a € Z/(q —1)Z whose order divides m a homomorphism G —> Z/(q — 1)Z via 
¢ ++ a. Therefore, the desired homomorphisms G —> Z/(q — 1)Z correspond 
bijectively to the elements in Z/(q—1)Z whose order divides m. An elementary 
calculation shows that their number equals gcd(m, q — 1). 


4.7, Exercise 1. Viewing L as a K-vector space, the map trz/~K: L — K is 
a linear functional on L, and its kernel ker trzj~ = {a € L; trz/x(a) = O} isa 
k-subvector space of L. If L/K is separable, then trz/q is nonzero, and hence 
ker tr/« is an (n— 1)-dimensional A’-subvector space of L. On the other hand, 
if L/K is not separable, then trz/K is the zero map, and we have ker trz/x« = L. 


4.7, Exercise 2. Let F = F,, F’ = Fy for prime powers q and q’ satisfying 
qd = q", where n is the degree of the extension F’/F. First we want to show that 
the norm map N: F’/* —+ F* is surjective. Taking into account the fact that the 
Galois group Gal(F’/F) is generated by the relative Frobenius homomorphism 
a+t—> a’, the norm of an element a € F’ is calculated as 


2 n—1 gal 
N(a) =a-a‘-a’-...:a% =aer, 


In particular, we see that N(a)?~? = a?"~! = 1. Now we use that the group F”™ is 


cyclic, generated by some element a of order g”—1. Then N(a@) = <—s € F is of 
order q—1 and hence is a generating element of the cyclic group F*. Therefore, 
since N: F’* —+ F* is a group homomorphism, it is surjective. Furthermore, 
we see that the kernel of N consists of all elements a” such that (q—1)|r, or in 
other words, of all elements that are a (q¢— 1)th power of an element in F’*. 


4.8, Exercise 1. Let L/K be a finite cyclic Galois extension with a generating 
element o € Gal(L/K). For b € L* consider elements a,a’ € L* such that 
b = ao(a)~! = a’o(a’)~!. Then we get o(a/a’) = a/a’ and hence a/a’ € K*. 
Conversely, we see from a/a’ € K* that ao(a)~! = a’o(a’)~!. As a conclusion, 
if b € L™ satisfies Nz/x(b) = 1, then the element a € L*, which satisfies 
b = ao(a)~! according to 4.8/1, is unique up to a multiplicative constant in K*. 
In the same way one shows in the setting of 4.8/4 for every b € L with trace 
trz/x(b) = 0 that the corresponding element a € L that satisfies b = a — o(a) 
is unique up to an additive constant in K. 


4.8, Exercise 2. The Galois group Gal(C/R) is cyclic of order 2, generated by 
the complex conjugation C —> C, z+» Z. Thus, we get Ncyr(z) = 22 = |z|? 
for z € C. Now assume Ne;r(z) = 1, ie., that z belongs to the unit circle 
centered at 0. Then Hilbert’s Theorem 90 asserts the existence of an element 
x € C%* satisfying z = x/Z, where we may even assume xv = |x|? = 1. This 
implies z = x”, i.e., that x is a square root of z. 
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4.9, Exercise 1. We start with a cyclic extension L/K of degree n. Writing 
C= LL" K*, we have L = K(C¥") by 4.9/3 and n = [L: K] =(C: K*") by 
4.9/1. The Galois group Go = Gal(L/K) is cyclic of order n by our assumption. 
Hence, the same is true for Hom(C/K*", U,,) by 4.9/3 and for C/K*" by 4.9/2. 
Now choose an element c € C whose residue class generates the group C/K*". 
Then L = K(c'/), and we see that the extension L/K is obtained by adjoining 
a zero a of the polynomial X" —c € K[X] to K. By reasons of degree, this 
polynomial is irreducible and therefore coincides with the minimal polynomial 
of a over K. 

Conversely, consider an extension L/K that is obtained by adjoining a zero 
a of a polynomial of type X" — c to K, where we may assume c € K™. Let 
C be the subgroup that is generated in K* by c and K*". Then we can write 
L = K(C'"). Furthermore, we conclude from 4.9/3 that L/K is an abelian 
extension with Galois group Hom(C/K*",U,,), resp. C/K*", since the latter 
group is finite. Now observe that C/K*" is generated by the residue class of ¢ 
and hence that L/K is cyclic. Since c” € K*", the group C/K*" is cyclic of 
some order d dividing n. In particular, we get c! € K*” and hence a’ € K. 
Then we can see, similarly as before, that X? — a% is the minimal polynomial 
of a over K. 


4.9, Exercise 2. The group C = K™ is the largest of all subgroups in K* 
containing K*". Therefore, it follows from 4.9/3 that likewise, L, = K(K*") 
is the largest abelian extension of AK with an exponent dividing n. Since ev- 
ery group homomorphism AK* —+ U,, is necessarily trivial on K*", we get 
Gal(L,/K) = Hom(K*,U,,), again by 4.9/3. 


4.10, Exercise 1. Let us consider the setting of Theorem 4.10/1. We claim 
that a field extension L/K is cyclic of a degree dividing n if and only if there 
exists an element a € A such that g(a) € Ax and L = K(a). The argument 
is the same as the one used in Exercise 1 of Section 4.9. First, assume that 
L/K is a cyclic Galois extension of some degree dividing n. Then, according 
to 4.10/1, we have L = K(g~!(C)) for C = g(Azr) NM Ax, and there is an 
isomorphism C/p(Ax) > Hom(Go, Un), where Go is the Galois group of 
L/K. By our assumption, Gc is cyclic of some degree dividing n. The same is 
true for C/p(Ax), due to 4.9/2, and there is an element c € C whose residue 
class generates C/g(Ax). Choosing a preimage a € gy l(c), it follows that 
g'(C) is generated by a and Ax, and we get L = K(a), as claimed. 

On the other hand, assume L = K(a) for some element a € A satisfying 
g(a) € Ax. Then we get L = K(o7'(C)), where C is generated by o(a) and 
(Ax). In particular, C/e(Ax) is cyclic, generated by the residue class of (a), 
and we see from 4.10/1 that L/K is an abelian extension of some exponent 
dividing n. Using 4.9/2, it follows that L/K is even cyclic. 


4.10, Exercise 2. Since K is a perfect field of characteristic p > 0, the Frobe- 
nius homomorphism kK —+ K is an isomorphism, and the same is true for the 
Frobenius operator F': W(A’) —+ W(K). Therefore, the relation V o F = p of 
4.10/7 implies p- W(K) = V'W(K). 
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Concerning assertion (i), recall the formula 


(a, 0,0,...)« (8,0, 0,...) = (a+ B,0,0,...) 


for the multiplication in W(4’), which was mentioned in Section 4.10. It implies 
that the map K —> W(K), a +> (a,0,0,...), is multiplicative and hence 
restricts to a monomorphism of multiplicative groups h* —>+ W(K)*. On the 
other hand, there cannot exist a nontrivial map K —+ W(K) that is additive. 
Indeed, multiplication by p yields the zero map on K, while up to the Frobenius 
operator, it equals the Verschiebung operator on W(‘). 

Next we turn to assertion (ii). We have to show that W(A), together with 
the projections W(K) —> W(K)/V"W (Kx), constitutes a projective limit of 
the projective system 


W(K)/V°W(K) — W(K)/V'W(K) — W(K)/V?W(K) HH... . 


To justify this we establish the defining universal property of Section 4.2. In- 
deed, consider a ring R, together with a system (h,)nen of ring homomorphisms 
hn: R— W(K)/V"W (Kx) that are compatible with all projections 


W(K)/Vi4W(K) 4 W(K)/ViW(K), i EN. 


Then the maps h, admit a common unique factorization through W (Iv), via 
the map 


h: R—>W(K), 2-4 (ha(a)o, ho(x)1,hs(x)2,-..), 


where hy+1(2)n is in each case the component of hnii(x) € W(K)/V""!W(K) 
of index n. That h is even a ring homomorphism follows by a formal argument 
on projective limits, or by explicitly using the definition of the ring structure 
on W(k) in terms of the polynomials S,,, P,,. Therefore, the first part of asser- 
tion (ii) is clear, while the second one, namely that W(F,) coincides with Z,, is 
easily derived from 4.10/10. 

To verify (iii) look at the canonical projection W(t) —> W,(K) = K. It 
is an epimorphism with kernel V'W(K) = p- W(K), and we thereby see that 
p- W(X) is a maximal ideal in W(A). Furthermore, we claim that this ideal 
is the only maximal ideal in W(J), in other words, that the group of units 
W(K)* coincides with W(K) — V'W(K). To justify this, consider an element 
a € W(K) —V'W(K). In order to show that it is a unit, we can multiply a by 
a unit of type (a,0,0,...) for some a € K™; cf. (i). In this way, we can assume 
that a is of type 1 — p-c for some element c € W(K). Now, using the relation 
p’- W(K) = V'W(K), it is easy to see that b = Yo,-yp'- c gives rise to a 
well-defined element in W(K). Indeed, the image of each finite sum )>}_, p'- 
with respect to the projection W(K) —> W(k)/V"W (kx) is independent of s 
for s > n. Furthermore, the formula for the geometric series shows that a - 6 is 
mapped to the unit element under all projections W(K) — W(K)/V"W (KR), 
in other words, that the relation a-b = 1 holds in W(K). 
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Therefore, we have recognized W(K) — p- W(K) = W(K) — V'W(K) as 
the group of units in W(/’). Next observe for every nonzero element a € W(K) 
that there is a unique integer n € N satisfying a © V"?W(K) — V""!W(K). In 
this way, we can write a = p"-a’ for some element a’ € W(K) — V'W(K) and 
hence for some unit a’ € W(K)*. Since p?- W(K) = V"(K), it follows that p 
is not nilpotent and consequently that W (Jc) is an integral domain. Moreover, 
every nontrivial ideal a C W(Ix) satisfies 


a=(p"), where n=min{iEN; p' € al}. 


Thus, W (Jc) is a principal ideal domain. Let us add that principal ideal domains 
admitting precisely one nontrivial maximal ideal are referred to as discrete val- 
uation rings. In particular, W (Jc) is such a discrete valuation ring. 


4.11, Exercise 1. Choose a K-vector space basis (a;);er of A. Then (a; @ 1)ier 
is a K'-vector space basis of A @x K', and every element of A @x K' can be 
written as asum >7,-, a; ® G with unique coefficients c; € K’, where c; = 0 for 
almost all indices 2 € I. Now, to introduce the multiplication by some element 
Due 14; ®c; on A®x K’, consider in a first step the (right) multiplication by 
a term a; @ c}: 


Ya; c,: A@x K' — A®x K’, Sai @ 4 4 Saja; ® cic}. 
i€l i€l 
Then the multiplication by 5> der 


Pa;,c': In this way, we get a map 


a; ®@ cj can be defined as the sum of the maps 


(A @x K’) x (A @x kK’) — > A@K Kk’ 


that is characterized by the mapping rule (a ® c,a’ @ ¢) +> aa’ @ cc’. Using 
this rule, the properties of a ring multiplication can be read directly from the 
corresponding properties of A and A’. Furthermore, A @x K’ is a K'-algebra 
via the ring homomorphism A’ —> A @x K',c-> 1c. 


4.11, Exercise 2. To prove assertion 4.11/4 (i), it is enough to show for every 
finite-dimensional AK-subspace Vo C V that the inclusion map \: V > V’ gives 
rise to a A’-linear map Aj: K’ @«% Vo —> V’ that is injective. This can easily 
be justified by induction on r = dim, Vo. Indeed, nothing has to be shown for 
r = 0. Therefore, let r > 0. Then there is a nonzero vector x € Vo, and we 
can view the K-vector space Vo/Ka as a part of the fixed set corresponding 
to the action that is induced by f on V’/K'x. The canonical K’-linear map 
K' @K (Vo/Ka) — V'/K'x is injective by the induction hypothesis, and an 
easy calculation shows that then Aj: K’ @x Vo —> V’ is injective as well. 
To verify 4.11/4 (ii), observe that f,(a;v) = o(a;)f.(v) and hence that 


Y Hew) = ola), 1a 
o€G oc6G 


Since the matrix (0(a;))¢eGici..n € (K')”*” is invertible by 4.6/3, the elements 
fo(v), o € G, and in particular v, can be written as K’-linear combinations 
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of the elements 45 = dojcq fo(aiv), i = 1,...,n. All vectors v; are fixed by 
the action of G on V’ and therefore belong to V. However, this implies that 
NV: K' @x V — V' is surjective. 

5.1, Exercise 1. The H-orbit of an element g € G under left translation by H, 
i.e., under the action H x G —> G, (h,g) +> hg, equals the right coset Hg. 
If {g1,..-,9,} is a system of representatives of the right cosets of G modulo 
H, then the orbit equation reads ordG = )>\_, ord(Hq;). Furthermore, the 
number r of right cosets of H coincides with the index (G : H), and all right 
cosets Hg; contain precisely ord H elements. Therefore, the orbit equation yields 
ordG = (G: H)-ord H, which coincides with the formula given by the theorem 
of Lagrange 1.2/3. If we consider right translation instead of left translation 
by H, more precisely, the action H x G —> G, (h,g) +> gh, then the 
corresponding H-orbits are of type gH and hence coincide with the left cosets 
of H. Also in this case, the corresponding orbit equation yields the formula of 
1.2/3. 


5.1, Exercise 2. A Galois automorphism o € Gal(L/K’) leaves a given ele- 
ment a € L fixed if and only if it leaves the field K(a) fixed. Therefore, the 
stabilizer group of a satisfies G, = Gal(L/K(a)). Next, the orbit Ga consists 
of all elements that are conjugate to a over K in the sense of Galois theory; cf. 
4.1. For example, if f € K[X] is the minimal polynomial of a over K’, these 
are precisely the zeros of f. Indeed, every o € Gal(L/K’) maps the zero set of 
f into itself. On the other hand, using the normality of L/K, the polynomial 
f factors in L[X] completely into linear polynomials (cf. 3.5/4 and 3.5/5). In 
addition, given a zero a’ € L of f, there is an automorphism o € Gal(L/K) 
such that o(a) = a’ (cf. 3.4/8 and 3.4/9). In particular, since L/K is separable, 
we get ord Ga = degf = [K (a): K], as well as ord G, = [L: K(a)]. 


5.2, Exercise 1. Let G be a finite abelian group and p a prime number. Then 
there exists a p-Sylow group S C G by Theorem 5.2/6 (i). Since all p-Sylow 
groups in G are conjugate due to 5.2/6 (ii), and since G is abelian, it follows 
that S is the only p-Sylow group in G. Thus, applying 5.2/6 (i) again, S' is as 
described in 5.2/2. In this way, Theorem 5.2/6 shows that the elements of G, 
whose orders are p-powers, form a p-Sylow group in G, a fact that was proved 
in 5.2/2 by elementary means. 


5.2, Exercise 2. If S C G is a p-Sylow group, its image y(S) consists of 
elements whose orders are powers of p. Hence, y(S) is a p-group by 5.2/11, and 
we conclude from 5.2/6 that there exists a p-Sylow group 9’ in G’ such that 
y(S) CS". If y is injective, say yp: G > G’, we get necessarily S’NG = S, since 
S’NG is a p-group in G containing S. In other words, if G C G’ is a subgroup, 
the p-Sylow groups of G are restrictions of (certain) p-Sylow groups in G’. On 
the other hand, if G is a normal subgroup of G’, then every p-Sylow group 
S’ c G’ restricts to a p-Sylow group $’MG in G. Indeed, $"M G is a p-group 
and thereby contained in a p-Sylow subgroup S of G. Then S is a p-group in 
G", and there is an element g € G’ such that gSg~! C S’; cf. 5.2/9. Since G 
is normal in G’, we get gSg~' C G, and it follows that gSg7' is a p-Sylow 
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group in G, since ord S = ord gSg~t. However, since gSg~' is contained in the 
intersection S’M G and since this is a p-group, we get S’NG = gSqg7t, ice., 
S'G is a p-Sylow group in G. 

Furthermore, let us look at the case that y: G —> G’ is surjective. Fixing a 
p-Sylow group S C G, we claim that its image H’ = y(S) is a p-Sylow group in 
G’. To justify this, consider the left translation of G as an action on the set of 
left cosets G’/H'. This action is transitive, i-e., admits only a single orbit. If H 
is the stabilizer group of H’ viewed as an element in G’/H’, we get S C H, as 
well as ord G/H = ord G’/H' by the orbit equation. Then pford(G/S) implies 
p{ord(G/H), and hence p{ord(G'/H’). In particular, since H’ is a p-group, 
it is already a p-Sylow group in G’. Thus, the image of every p-Sylow group 
S CG is a p-Sylow group in G’. On the other hand, it is easy to see, using the 
conjugation action, that every p-Sylow group in G’ is the image of a p-Sylow 
group in G. 


5.3, Exercise 1. A permutation 7 € G,, is a bijective self-map on the set 
{1,...,n}. In other words, 7 “permutes” the numbers 1,...,7, i.e., it changes 
their order to 7(1),...,7(n). For example, if 7 is a transposition, it exchanges 
precisely two elements of the sequence 1,...,n. In this way, it is plausible that 
the numbers of the sequence 1,...,n can be put into any desired order by a 
repeated process of exchanging only two elements at each step. In fact, this 
corresponds to the assertion that every 7 € G, can be written as a product of 
transpositions. 

To give a strict proof, use induction on n, where the base case n = 1 is 
trivial, since G; consists only of the unit element, which can be interpreted 
as the empty product. Therefore, assume n > 1. If there exists an index 
i€ {1,...,n} such that (i) = 7, then 7 restricts to a bijective selfmap 7’ on 
{1,...,¢—1,i+1,...,n}. Relying on the induction hypothesis, z’ is a product 
of transpositions, and the same is true for 7. On the other hand, if there is an 
index i € {1,...,n} such that (i) ¥ i, then (7, 7(7)) o 7 leaves the element 7 
fixed and hence is a product of transpositions, say (i, 7(i))o7 = 7,0...07p, as 
seen before. But this implies 7 = (i,7(7)) 07, 0...07, and hence that 7 is a 
product of transpositions. 


5.3, Exercise 2. Let 7 € G, be a p-cycle, say 7 = (1,...,p). Then the cyclic 
group (7) generated by 7 is a p-Sylow group in G,. Indeed, its order is p, and 
we have p{(G, : (m)) as (G,: (7)) = (p—I)!. 


5.4, Exercise 1. Assume first that H is a normal subgroup of G. Then the 
relation g[a,b]g~'! = [gag~', gbg~*], see the proof of 5.4/1, shows that [G, H] 
is a normal subgroup in G. We claim that [G, H] is the smallest of all normal 
subgroups N C G such that the image of H in G/N is contained in the center 
of G/N. Indeed, the image of H in G/[G,H] commutes elementwise with all 
residue classes of elements g € G. Hence, the image of H is contained in the 
center of G/[G, H]. Conversely, if N C G is a normal subgroup satisfying this 
condition, then all commutators [a,b] for a € G and b € H belong to N, so 
that we have [G, H] C N. Therefore, our claim is justified. On the other hand, 
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if H is only a subgroup in G, then the above argument still yields the following 
assertion: If N C G is a normal subgroup such that the image of H is contained 
in the center of G/N, then [G, H] CN. 


6.1, Exercise 1. If f(x) = 0 is solvable over K, it will not necessarily be 
solvable over Ko as well. For example, there exist algebraic equations f(x) = 0 
over Q that are not solvable, as we saw at the end of Section 6.1. Clearly, 
such an equation will become solvable over a splitting field of f, or over an 
algebraic closure of Q. On the other hand, if f(a) = 0 is solvable over Ko, it 
is solvable over K as well. Indeed, if ZL is a splitting field of f over K, and 
[Io C L a splitting field of f over Ko, then there is a canonical restriction 
map Gal(L/K) —> Gal(Lo/Ko) by 3.5/4. This map is injective, since both 
extensions L/K and Lo/Ko are generated by the zeros of f. Thus, if Gal(Lo/ Ko) 
is solvable, then the same is true for Gal(L/K) by 5.4/8. 


6.1, Exercise 2. Assume first that the equation f(x) = 0 is metacyclic. Then 
there exists a chain of fields K = Ky C Ky, Cc... C K, such that the splitting 
field L of f over K is contained in K,, and Kj+,/K; is in each case a (finite) 
cyclic, and therefore solvable, Galois extension. Using 6.1/4, this implies that 
the extensions K,/K and L/K are solvable as well. 

Conversely, consider the case that f(x) = 0 is solvable. Then the Galois 
group Gal(L/K) is solvable, and we can conclude from 5.4/7 that Gal(L/K) 
admits a normal series with cyclic factors. Thus, the fundamental theorem of 
Galois theory 4.1/6 shows in conjunction with the primitive element theorem 
3.6/12 that the equation f(x) = 0 is metacyclic. Thereby we see that “meta- 
cyclic” is equivalent to “solvable.” The remaining equivalence to “solvable by 
radicals” follows from 6.1/5. 


6.2, Exercise 1. According to the theory developed in Section 6.2, consider 
the chain of field extensions 


KCK(VA)CI'cL, 


where L’ is a splitting field of g and L a splitting field of f over K, and where 
A is the common discriminant of f and g. The Galois group G = Gal(L/K) 
acts on the zeros 21,...,2%4 € L of f and thereby can be viewed as a subgroup 
of Gy. As we know already, A admits a square root in K if and only if G acts 
in terms of even permutations on the «;, in other words, if and only if G C 4. 
Therefore, the degree of K(VA) over K equals 1 or 2, depending on whether 
we have G C 4 or G Z Ay. 

Since f does not have real zeros, the complex conjugation C —> C, z +> 2, 
restricts on L to a nontrivial element of the Galois group G. In particular, the 
zeros of f consist of two pairs of conjugate complex numbers, say where x2 = 21 
and #4 = %3. Furthermore, we conclude that the zeros 


m4 =(%14+%2)(%3 +04), 2 = (1+ %3)(t2+ 24), 23 = (%1 + 24)(L2 + 23) 


of g satisfy z; € R, as well as 22,23 > 0, where z; # 0 for all 7, since g is 
irreducible. In particular, we get L' C R. Furthermore, the degree of L/K is 
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divisible by 3, using the irreducibility of g again, and it follows that L’/K(VA) 
is of degree 3. 

Now we claim that the Galois group H = Gal(L/K(vA)) coincides with 
Q,. Of course, we have H C 4. Furthermore, observe that complex conjugation 
leaves L’ C R fixed and therefore gives rise to a nontrivial element in Gal(L/L’). 
In particular, the degree [L : L’] is divisible by 2 and hence is at least 2. It 
follows that the order ord H = [L : K(VA)] is at least 6, hence 6 or 12, and it 
is enough to exclude the case ord H = 6. To do this, assume ord H = 6. Then 
there exists in H precisely one 3-Sylow group by the Sylow theorems 5.2/6. In 
addition, the chain K(WA) C L' C L, where necessarily [L : L'] = 2, shows 
that H contains a normal subgroup of order 2, which is a 2-Sylow group in H 
and, being normal, the only 2-Sylow group in H. But then the proof of 5.2/12 
shows that H is cyclic of order 6, in contradiction to the fact that G4 contains 
only elements of order 1, 2, 3, or 4. Thus, ord H = 12, and we can conclude that 
Gal(L/K(vA)) = 24, as claimed. To sum up, we get Gal(L/K) = % if A isa 
square in K, as well as Gal(L/K) 2 24, and therefore Gal(L/K) = Gy, if A is 
not a square in Kk. 

Let us add an example illustrating our discussion. Consider the algebraic 
equation f(x) = 0, where 


f=X*4+X?4+X41eQ[X]. 


Clearly, f does not admit real zeros and is irreducible. The resolvent cubic is 
given by 
g = X?-2X?- 3X +1€ Q[X] 


and is irreducible as well. Furthermore, the discriminant of f and g is 


A = 144 — 128 — 44 16 — 27 + 256 = 257. 


Since 257 is not a square in Q, we see that the Galois group of the equation 
f(x) = 0 equals 64. 


6.3, Exercise 1. The properties of real numbers used in the proof of 6.3/1 
cannot be justified in terms of purely algebraic methods, for instance, as de- 
veloped in the present book. This comes as no surprise, since we have assumed 
the real numbers to be “known,” without giving a precise characterization of 
them. Anyway, the study of real numbers and of real-valued functions is rather 
a part of analysis than of algebra. Accordingly, we justify the desired properties 
by means of infinitesimal calculus. Therefore, let f = X" +a,X"14+...+4ay 
be a polynomial of odd degree in RLX]. Using the factorization 


fiz) sa"(l4+aqac'+...+a,2) 
for x € R, x £0, we conclude that 


lim f(x) = oo, lim f(x) = —oo. 


«r—-0o 
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Then the intermediate value theorem asserts that f(x), as a real-valued contin- 
uous function, admits a zero in R. For a similar reason, every a € R, a > 0, 
admits a square root in R. Indeed, consider the function g(x) = x” — a. By the 
intermediate value theorem again, it admits a zero in R, since g(0) < 0, as well 
as liMy-+o0 g(@) = Ov. 


6.4, Exercise 1. Write K = Q(M U M). Looking at a point z € &(M), we 
conclude from 6.4/1 in conjunction with the multiplicativity formula 3.2/2 that 
the degree [K(z) : K] is a power of 2. Conversely, let z € C be an element 
satisfying such a condition. Applying 6.4/1, we get z € (MM), provided the 
extension A’(z)/K is Galois. However, in general we obtain z € R(M) only if z 
is contained in a Galois extension of K whose degree is a power of 2. Writing 
L for the field that is generated over K by all conjugates of z, i.e., for the 
splitting field of the minimal polynomial of z over K’, the preceding condition 
is equivalent to the condition that the degree [L : kK] is a power of 2. However, 
there are examples for which [A(z) : K] is a power of 2 but [L : K] is not. Just 
observe that there exist irreducible algebraic equations of degree 4 with Galois 
group G4, as we will see below. In particular, we cannot necessarily conclude 
that z € RUM) from the fact that [A (z) : A] is a power of 2. 

To explicitly construct such an example, let M = {0,1} and consider a 
polynomial of type f = X*— pX —1€ Q[X] for a prime number p. Then f is 
irreducible. Indeed, it is enough to show that f is irreducible as a polynomial 
in Z[X]; see 2.7/7. This is easily checked by showing that f does not admit a 
decomposition over Z into a linear and a cubic polynomial, or into two quadratic 
polynomials. Now let a1,...,a4 € C be the zeros of f, and let L = Q(ay,..., a4) 
be the splitting field of f in C. Then the solution process of quartic equations 
in Section 6.1 shows that the quantities 


Bi = (a1 + a2)(a3 +04), Bo = (a1 t+as)(a2+a4), 83 = (a1 + a4)(a2 + a3) 


are the zeros of the resolvent cubic of f, which is given by the polynomial 
g = X?+4X +p’. Similarly as for f, one checks that g is irreducible over Q. 
Therefore, L contains elements of degree 3 over Q. Hence, the degree [L : Q] 
cannot be a power of 2. In particular, aj,...,a4 do not belong to R({0, 1}), 
although each a; is of degree 4 over Q. In addition, it is easily seen that the 
Galois group Gal(L/Q) coincides with the full permutation group G4 when the 
elements o € Gal(L/Q) are viewed as permutations of the zeros a1,...,a4. In- 
deed, since the Galois group Gal(L/Q) is a subgroup of Gy, its order divides 24. 
Furthermore, since LZ contains elements of degree 3 as well as of degree 4 over Q, 
the order is at least 12. Therefore, we can conclude either that Gal(L/Q) = G,, 
or that Gal(Z/Q) is a subgroup of index 2 and hence a normal subgroup in 
G,. In the latter case, we have Gal(L/Q) = 24, since each normal subgroup of 
index 2 in Gy gives rise to an abelian factor group, and since [G4, G4] = 24; 
cf. 5.4/1 and 5.4/2. Now observe that the discriminant 


Ag = (81 — f2)*(B1 — B3)?(B2 — 83)” = —4- 4° — 27p* 
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of the resolvent cubic g does not admit a square root in Q; concerning the for- 
mula for A,, consult example (2) in 4.3 or the end of Section 4.4. Therefore, not 
all elements of Gal(Z/Q) will restrict to an even permutation on the (1, 62, 33, 
and it follows then from the definition of the 6; that Gal(Z/Q) cannot consist 
of even permutations of the a1, a2, a3, a4 only. This implies Gal(L/Q) = Gz, as 
claimed. 


6.4, Exercise 2. As we know, the primitive third root of unity ¢; = e?7/3 € C 
belongs to R({0, 1}); cf. 6.4/3. Now if angle trisection were generally possible 
in terms of compass and straightedge constructions, the primitive ninth root 
of unity Gy = e2”/° could be obtained via such constructions. However, such 
is not the case by 6.4/3, since y(9) = 6. Therefore, in general, the problem of 
angle trisection does not admit a solution in terms of compass and straightedge 
constructions. This is no surprise, since trisecting an angle y corresponds to 
solving the equation z? — e’? = 0, or if we restrict ourselves to the real part of 
this equation and use zZ = 1, to solving the equation 42° — 3x — cosy = 0. 
In general, such cubic equations cannot be accessed in terms of compass and 
straightedge constructions. 


7.1, Exercise 1. Let L/K bea field extension and X¥ = (2;);er a transcendence 
basis. Then the system X is algebraically independent over kK. Furthermore, 
we can view X as a system of variables, and the subring A[X] C L asa 
polynomial ring in the variables x;. Clearly, this implies that the system X is 
linearly independent over K if we view L as a K-vector space. But on the other 
hand, it is impossible that ¥ generates A[X], or even L, as a K-vector space. 
It is for this reason that a transcendence basis of L/K will never give rise to a 
K-vector space basis of L. 

Nevertheless, there is a strong conceptual analogy between bases of vector 
spaces and transcendence bases of field extensions. Under this analogy, “linear 
independence” of a system X of elements of a K-vector space V corresponds to 
“algebraic independence” of a system X of elements of a field extension L/K. A 
basis of V is a linearly independent system X C V generating V as a K-vector 
space. Likewise, a transcendence basis of L/K is an algebraically independent 
system X C L, “generating” L/K in the sense that L/A(X) is algebraic. Just 
as for vector space bases, transcendence bases can be characterized as maximal 
algebraically independent systems (cf. 7.1/3), or alternatively, as minimal “gen- 
erating systems” in the sense just alluded to. Also the proof of 7.1/5, that every 
two transcendence bases of L/K are of the same cardinality, remains valid for 
vector space bases. 

But let us add that there are natural limits to the analogy between such 
bases. For example, every bijection X —> ) between two bases of a vector space 
V gives rise to a unique K-automorphism of V. The corresponding assertion 
for transcendence bases of a field extension L/K fails to be true, with respect 
to both the existence and the uniqueness assertions. Just consider a simple 
purely transcendental extension L = K(X) of a field K. Each of the elements 
X and X? defines a transcendence basis of L/K, and there is a K-isomorphism 
K(X) —> K(X?) mapping X to X*. But this isomorphism does not extend to 
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a K-automorphism of K(X), since X does not admit a square root in K(X). 
On the other hand, if L is an algebraic closure of K(X), the identity map on 
K(X) extends to a K-automorphism on L. But it is not unique, since there are 
nontrivial K(X )-automorphisms of L. 


7.1, Exercise 2. First let us show that C admits automorphisms that do 
not leave R fixed. To do this, choose an element « € R, say « = 7, that 
is transcendental over Q. Following 7.1/4, the extension C/Q admits a tran- 
scendence basis X such that « € X. Since the element ix € C is transcen- 
dental over Q as well, there exists a transcendence basis 2) of C/Q such that 
ix € Y. Then ¥ and Y) are of same cardinality by 7.1/5, and there is a bi- 
jection X —+ Q), where we may assume x +—> ix. This bijection extends to a 
Qisomorphism Q(X) > Q(Q)). Since C is an algebraically closed field that is 
algebraic over Q(X) and over Q(2)), we can view C as an algebraic closure of 
both Q(X) and Q(Q)). Therefore, 


a: Q(X) oC, T: Q(X) ~ QY) oC 


are two algebraic closures of Q(X). Then, by 3.4/10, there exists an automor- 
phism y: C —> C satisfying rT = yoo. Since we have y(x) = ix by construction, 
it follows that y is an automorphism of C that does not leave R fixed, which 
had to be shown. In particular, y(R) is a subfield of C that is isomorphic to R, 
but different from R. 

We proceed in a similar way to show that C admits proper subfields iso- 
morphic to itself. Choosing a transcendence basis X of C/Q, we use the fact 
that X consists of infinitely many elements; cf. Exercise 3 of 7.1. Then there 
exists an injective map X ~ X that is not surjective. To justify this, one may 
use, as shown in 7.1/7, that X is a disjoint union of countably infinite sub- 
sets of X. Then the considered injection  —+ X extends to a monomorphism 
L: Q(X) — Q(X) such that Q(X) is not algebraic over the image of 1. Again we 
consider the two homomorphisms 


oc: Q*OC, 7:QAX0 6QX OC. 


Then C is an algebraic closure of Q(X) with respect to the injection o, but 
not with respect to 7. Using 3.4/9, there is a Q(X)-homomorphism y: C — C 
satisfying T = yoo. Since C is not algebraic over the image of 7, we can exclude 
that y is surjective. Therefore, y(C) is a proper subfield of C that is isomorphic 
to C. 


7.2, Exercise 1. Let ®: M —> E bean R-linear map to an R’-module EF. We 
have only to show that @ gives rise to a unique R’-linear map y: M®zR' —> E 
such that 7 @ 1+ &(x) for x € M. To justify the existence of y, consider the 
R-bilinear map M x R' —>+ E, (x,a) +> a(x). By the universal property of 
tensor products, it induces an R-linear map y: M @z R' —> E that is uniquely 
characterized by y(% ® a) = a®(x) for a € R’ and x € M. In particular, it 
follows that vy, as a map between R’-modules, is even R’-linear. On the other 
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hand, if w: M @p R' —> E is another R’-linear map satisfying ¢)(2 @ 1) = &(x) 
for 2 € M, then ~ coincides on all tensors of type x ® 1 with wy. However, this 
implies y = w, since these tensors generate M/ @®p R’ as an R’-module. 


7.2, Exercise 2. First we look at the case of free polynomial rings R’ = R[X] 
and R” = R[Q)], with systems of variables X,2). Then the polynomial ring 
R[X,Y], together with the canonical injections o’: R[X] —> R[X,Q] and 
o”: R[Y| —> R[X,Y], admits the universal property of 7.2/9. Indeed, a ho- 
momorphism of R-algebras R[X¥,2)] —> A is uniquely characterized by the 
images of the members of X and Q), while such images can be chosen arbitrar- 
ily. In the general case, R’ and R” can be interpreted as residue class rings of free 
polynomial rings, say R’ = R[X]/a and R” = R[Q)]/b. Then R[X, 2] /(a, 6), 
together with the canonical maps o/: R[X]/a —> R[X,Q]/(a,b), as well as 
o”: R[Y|/b — R[X,Y]/(a, 6), admits the universal property of 7.2/9. Indeed, 
ify’: R[X] —> Aand y”: R[Y] —> Aare two R-algebra homomorphisms sat- 
isfying a C ker y’ and b C kery”, then the resulting R-algebra homomorphism 
p: R[X,Y] — A satisfies (a, b) C ker y. 


7.3, Exercise 1. In many cases the question has a negative answer. Consider as 
an example of a regular extension a purely transcendental extension K(X )/K 
for a variable X. If char K = 2, the extension K(X)/K(X7?) is purely insepa- 
rable and therefore not separable, since it is nontrivial. On the other hand, if 
char K 4 2, this extension is separable algebraic, but not primary. 


7.3, Exercise 2. Also this question has a negative answer. To give an example, 
choose a field k of characteristic p > 0 and consider for variables X,Y, Z the 
purely transcendental extension k(X,Y,Z), as well as the following subfields: 


K =k(X?,Y”),  L=k(X?,Y?,Z)(t), where t=X+YZ. 


We want to show that the extension L/K is not separable, although K is alge- 
braically closed in L. First observe that the extension L// decomposes into the 
purely transcendental extension K(Z)/K and the purely inseparable extension 
L/K(Z) of degree p; note that t? — (X? + Y?Z?) = 0 is the irreducible equa- 
tion of t over K(Z). To see that L/K is not separable, consider the elements 
t?, 1”, Z”. These are linearly dependent over K’, as follows from the preceding 
equation. Now, if the extension L/K were separable, then by 7.3/7 (iv), also 
the elements t, 1, Z would be linearly dependent over kK. This implies t € K(Z), 
which, however, is not the case. Consequently, L/K cannot be separable. 
Thus, it remains to show that K is algebraically closed in L. Therefore, 
look at an element a € L that is algebraic over K. Then a? € K(Z). However, 
since every element in K(Z)—K is transcendental over K (cf. 7.1/10), we must 
have a? € K, so that a € k(X,Y). Assume that a does not belong to A’. Then 
a ¢ K(Z), which implies K(Z)(a) = L, since [L : K(Z)] = p. Now observe 
that K(Z)(a) = K(a)(Z). Thus, to obtain Z we may just as well first adjoin the 
algebraic element a to K and then the transcendental element Z. It follows that 
the element t = X¥ + YZ € L = K(a)(Z) can be written as a quotient of two 
polynomials in K(a)[Z] C k(X,Y)[Z], say X +YZ = f(Z)g(Z)~*. Canceling 
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powers of Z on the right-hand side we may assume g(0) 4 0, which implies 
X = f(0)g(0)-1 € K(a). But then, just as t belongs to K(a)(Z), the same is 
true for YZ. Therefore, Y belongs to A(a). This shows that A(a) = k(X,Y), 
which, however, is impossible, since a is only of degree p over K = k(X?,Y®”). 
Thus, K is algebraically closed in L, as claimed. 


7.3, Exercise 3. Let K be a perfect field of characteristic p > 0, for in- 
stance take K = F,, and let X be a variable. Furthermore, consider the purely 
inseparable closure L = K(X)? ~ of K(X). Then L/K is of transcendence 
degree 1. We claim that this extension is separable, but not separably gener- 
ated. To justify this, observe that L is the union of the ascending chain of fields 
K(X)?’ = K(X”'), i EN. Since for each i, the field K(X? ‘) is purely tran- 
scendental over A with transcendence basis X? ', we conclude from 7.2 /13 and 
7.3/3 that L/K is separable. 

Let us assume now that the extension L/K is even separably generated. 
Then there exists an element « € L that is transcendental over K such that 
L is separable algebraic over K(x). Since x is contained in one of the fields 
K(X”), there is a chain of fields K(x) C K(X? “) C L. Now if L/K(a) is 
separable algebraic, the same is true for L/K(X”’) by 3.6/11. However, in 
contradiction to this, the element X p*" ig purely inseparable of degree p over 
K(X” -'). Thus, the extension L/K cannot be separably generated. 

7.4, Exercise 1. The characterization of separable extensions L/K through the 
condition Q7, ae 0 is valid only for finitely generated extensions. For example, 


co 


if K is a nonperfect field of characteristic p > 0 and L = K? ~ is its purely 
inseparable closure (or an algebraic closure), then the extension L/K is not 
separable. On the other hand, since each element of L admits a pth root in LZ, 
every derivation on L is trivial. In particular, this implies 7 jn 0. 
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N 
Zz 
Q, Qso 
R,Rso 
C 


Map(X, G) 
GX Gs) 

Ta 

aH, Ha 
G/H, H\G 
(G: H) 
ordG 

G/N 

(z) 

orda 

R* 

H 

R*,R&) 
RX] 

deg f 

RX] 

a+b 

a-6 

ano 

(Gise0+ Gy) 
R/a 

Fp 

x=y moda 
tly 

aty 

Up(a) 
gced(x1,...,2n) 
Iem(21,---,2n) 
R[M] 
R[X,...,Xn] 
R[X] 

deg f 


natural numbers, including 0 

(rational) integers 

rational numbers, resp. positive rational numbers 
real numbers, resp. positive real numbers 
complex numbers 


set of maps 12 

G-valued functions on X 12 

left translation by a 14 

coset of a subgroup H 15 

set of cosets modulo H_ 16 

index of a subgroup H_ 16 

order of a group 16 

residue class group modulo a normal subgroup NV 
subgroup generated by an element 20 
order of an element 21 

group of units of a ring 26 

Hamiltonian quaternions 27 

R-valued functions on a set X 27 
polynomial ring in one variable X 28 
degree of a polynomial 29 

ring of formal power series 31 

sum of ideals 32 

product of ideals 32 

intersection of ideals 32 

ideal generated by ay,...,a, 32 

residue class ring modulo an ideal a 36 
field with p elements 38 

congruence 40 

x divides y 43 

x doesn’t divide y 43 

exponent relative to the prime factor p 47 
greatest common divisor of 71,...,2%, 47 
least common multiple of 71,...,%, 47 
polynomial ring attached to M51 
polynomial ring in n variables 51 
polynomial ring in a family X of variables 51 
total degree of a polynomial 55 
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357 


358 


R[x] 

Df, f 
Q(R) 
K(X), K(X) 
SR, Rs 
Up(2), Up(f) 
M/N 
Tier Mi 
Dier M; 
rank M 
SM 


f° a 
Hom x (L, kK) 
[L: K]s; 
#H 


Glossary of Notation 


smallest subring containing R and x 56 
derivative of a polynomial 


field of fractions of an integral domain 60 


rational function fields 60 


localization of a ring R60 
exponents relative to the prime factor p 61 
residue class module modulo a submodule N 


sum of modules 


69 


direct sum of modules 69 
69 

localization of a module M_ 70 
length of a module 
saturation of a submodule 
content of an element 72 
t-fold exterior power of a free module 


rank of a module 


characteristic of a field 85 


field extension 87 
degree of a field extension 87 
field generated over K by a system 2 90 


field generated over K by ay,... 


70 


algebraic closure of Q 93 


ring generated over A by 2,... 


58 


71 


algebraic closure of a field 104 
polynomial transported by a 105 
set of K-homomorphisms L —> K 113 
separable degree of a field extension 113 


number of elements in a set 


113 


field consisting of gq = p” elements 


algebraic set attached to F,a 126 


vanishing ideal of U 126 


radical of an ideal 


128 


ln 


> An 


75 


90 


94 


124 


group of K-automorphisms of L 135 
Galois group of L/K 


fixed field of G 
composite field 
projective limit 
inductive limit 


ring of integral ¢-adic numbers 


136 
139 
147 

147 


135 


150 


68 


elementary symmetric polynomials in n variables 
lexicographic degree of a polynomial 


discriminant of a polynomial 
resultant of two polynomials 


167 
168 


norm of multiplication with g(x) 
trace of the multiplication by a 173 


discriminant of 71,... 


>on 


173 


group of nth roots of unity 176 
Euler’s y-function 177 
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